home.social

#aicodingagent — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #aicodingagent, aggregated by home.social.

  1. Grok Build: xAI's Local-First Coding Agent with 8 Parallel Agents and Arena Mode — Complete Guide (April 2026)

    xAI's Grok Build runs up to eight parallel AI agents simultaneously on a single prompt, evaluates outputs algorithmically via Arena Mode, and keeps all source code on-device. Th...

    wowhow.cloud/blogs/grok-build-

    #wowhow #grokbuild #xai #aicodingagent

  2. An inside look at NanoAgent, an open-source AI coding agent built around developer workflows, permissions, and observable execution. hackernoon.com/building-an-ai- #aicodingagent

  3. “A 30-hour timeline of how Cursor's agent, Railway's #API, and an industry that markets #AISafety faster than it ships it took down a small business serving rental companies across the country. I'm Jer Crane, founder of PocketOS. We build #software that rental businesses — primarily car rental operators — use to run their entire operations: reservations, payments, customer management, vehicle tracking, the works. Some of our customers are five-year subscribers who literally cannot operate their businesses without us. Yesterday afternoon, an #AICodingAgent#Cursor running #Anthropic's flagship #ClaudeOpus 4.6 — deleted our production database and all volume-level backups in a single API call to Railway, our infrastructure provider.

    It took 9 seconds.

    The agent then, when asked to explain itself, produced a written #confession enumerating the specific safety rules it had violated.”

    When you use a cheap-arse #DBA.

    #AI / #WhiteCollar / #ZeroHourWork source <x.com/lifeof_jer/status/204810> comments <news.ycombinator.com/item?id=4>

  4. “A 30-hour timeline of how Cursor's agent, Railway's #API, and an industry that markets #AISafety faster than it ships it took down a small business serving rental companies across the country. I'm Jer Crane, founder of PocketOS. We build #software that rental businesses — primarily car rental operators — use to run their entire operations: reservations, payments, customer management, vehicle tracking, the works. Some of our customers are five-year subscribers who literally cannot operate their businesses without us. Yesterday afternoon, an #AICodingAgent#Cursor running #Anthropic's flagship #ClaudeOpus 4.6 — deleted our production database and all volume-level backups in a single API call to Railway, our infrastructure provider.

    It took 9 seconds.

    The agent then, when asked to explain itself, produced a written #confession enumerating the specific safety rules it had violated.”

    When you use a cheap-arse #DBA.

    #AI / #WhiteCollar / #ZeroHourWork source <x.com/lifeof_jer/status/204810> comments <news.ycombinator.com/item?id=4>

  5. “A 30-hour timeline of how Cursor's agent, Railway's #API, and an industry that markets #AISafety faster than it ships it took down a small business serving rental companies across the country. I'm Jer Crane, founder of PocketOS. We build #software that rental businesses — primarily car rental operators — use to run their entire operations: reservations, payments, customer management, vehicle tracking, the works. Some of our customers are five-year subscribers who literally cannot operate their businesses without us. Yesterday afternoon, an #AICodingAgent#Cursor running #Anthropic's flagship #ClaudeOpus 4.6 — deleted our production database and all volume-level backups in a single API call to Railway, our infrastructure provider.

    It took 9 seconds.

    The agent then, when asked to explain itself, produced a written #confession enumerating the specific safety rules it had violated.”

    When you use a cheap-arse #DBA.

    #AI / #WhiteCollar / #ZeroHourWork source <x.com/lifeof_jer/status/204810> comments <news.ycombinator.com/item?id=4>

  6. “A 30-hour timeline of how Cursor's agent, Railway's #API, and an industry that markets #AISafety faster than it ships it took down a small business serving rental companies across the country. I'm Jer Crane, founder of PocketOS. We build #software that rental businesses — primarily car rental operators — use to run their entire operations: reservations, payments, customer management, vehicle tracking, the works. Some of our customers are five-year subscribers who literally cannot operate their businesses without us. Yesterday afternoon, an #AICodingAgent#Cursor running #Anthropic's flagship #ClaudeOpus 4.6 — deleted our production database and all volume-level backups in a single API call to Railway, our infrastructure provider.

    It took 9 seconds.

    The agent then, when asked to explain itself, produced a written #confession enumerating the specific safety rules it had violated.”

    When you use a cheap-arse #DBA.

    #AI / #WhiteCollar / #ZeroHourWork source <x.com/lifeof_jer/status/204810> comments <news.ycombinator.com/item?id=4>

  7. “A 30-hour timeline of how Cursor's agent, Railway's #API, and an industry that markets #AISafety faster than it ships it took down a small business serving rental companies across the country. I'm Jer Crane, founder of PocketOS. We build #software that rental businesses — primarily car rental operators — use to run their entire operations: reservations, payments, customer management, vehicle tracking, the works. Some of our customers are five-year subscribers who literally cannot operate their businesses without us. Yesterday afternoon, an #AICodingAgent#Cursor running #Anthropic's flagship #ClaudeOpus 4.6 — deleted our production database and all volume-level backups in a single API call to Railway, our infrastructure provider.

    It took 9 seconds.

    The agent then, when asked to explain itself, produced a written #confession enumerating the specific safety rules it had violated.”

    When you use a cheap-arse #DBA.

    #AI / #WhiteCollar / #ZeroHourWork source <x.com/lifeof_jer/status/204810> comments <news.ycombinator.com/item?id=4>

  8. #Google launched its #AIcodingagent, #Jules, out of beta. Jules, powered by #Gemini 2.5 Pro, is an #asynchronous coding tool that integrates with #GitHub and uses #AI to #fix or #update #code. The tool received structured pricing tiers, including a free plan, and updated privacy policy. techcrunch.com/2025/08/06/goog #tech #media #news

  9. Researcher spots a critical prompt‑injection flaw in Cline AI’s coding agent (Claude‑based). The bug lets attackers run arbitrary code via GitHub Actions, exposing a serious AI vulnerability. Open‑source devs should watch out and consider mitigations. Read the full breakdown to see how the exploit works and what to do next. #ClineAI #PromptInjection #AICodingAgent #Cybersecurity

    🔗 aidailypost.com/news/hacker-ex

  10. Let me try to explain what frustrates me about #AICodingAgent generated (or assisted) PRs by example. This is just one example, but it's quite typical of what I see a lot:

    https://github.com/silverbulletmd/silverbullet/pull/1731

    First of all: very elaborate PR description that ostensibly sounds like some deep analysis happened here. I'm not sure what the original prompt was here, but I suspect (based on some others by the same author — which I all closed) he has some "magic prompts" along the lines of "find performance bottlenecks and fix them."

    And lo-and-behold, Claude found one (and probably more that are shared with yours truly in the 17 other PRs that this author opened):

    "Syscalls: Reduced from 4 → 3 per write (25% reduction). This optimization is in the critical user latency path - every file save operation hits this code."

    Now you will probably think: sounds reasonable, thank you!

    But... is this REALLY a critical user latency path? Every file save does hit this path, but how many of those happen, really? SilverBullet is a single user app, and saves happen (with a sync lag) at most every few seconds if you're actively editing and Internet connected. Is this a path worthy of even a minute of performance optimization? I can think of hundreds that would way more interesting. But here we are.

    Now this PR also adds a full benchmark suite to prove the made claims. I haven't actually looked at this code, to be honest because honestly it's pretty irrelevant because I don't feel there's a performance issue to be resolve here at all. I also haven't checked if all those stats in the PR are actually accurate. Again, doing so would take time, which I would consider 100% waste.

    But here's the kicker: this PR actually introduces possibly two bugs in the ~10 lines that it actually changes: one definite bug (as I comment in the PR): it sets the creation time of a file to be the modified time, which is just wrong, but actually the only "sensible" think you can do to avoid making the syscall which this PR eliminates.

    Second, more subtle, is a second bug is that it introduces a discrepancy between the OS reported file modified timestamp and the "unix clock" one, which it claims is "a few microseconds at most", but that's likely not true (and very filesystem dependent) and ANY discrepancy will mean that the sync engine breaks because it uses those last modified timestamps to check for changes.

    And here's the thing. Me explaining this, thinking about it, commenting on it took likely 10x more time than the author spent on producing this piece of art. It doesn't solve an actual problem, it adds 150 lines of useless benchmarks and to top it all off introduces 2 bugs. In this case I think there's no actual way to do this properly, the PR cannot be fixed, it is just based on a wrong analysis. But in many other cases it's possible to get it to some place "good" as in: correct, but STILL it would be a waste of time, because the problem doesn't exist or at least is not worth addressing.

    sigh

  11. #Salesforce launched #AgentforceVibes, an #AIpowered developer tool that helps developers work autonomously on Salesforce apps and agents. The tool includes an autonomous #AIcodingagent, #VibeCodey, which is connected to a company’s existing Salesforce account, reusing existing code and following coding guidelines. techcrunch.com/2025/10/01/sale #tech #media #news

  12. #OpenAI released #GPT5Codex, an upgraded version of its #AIcodingagent, #Codex. The new model, available to ChatGPT Plus, Pro, Business, Edu, and Enterprise users, offers improved performance on coding tasks due to its dynamic “thinking” abilities. OpenAI aims to make GPT-5-Codex available to API customers in the future. techcrunch.com/2025/09/15/open #tech #media #news

  13. Ah, yes, because nothing says "cutting-edge tech" like juggling Git worktrees and #Tmux while your AI coding agent goes "brrr" 🙄. Truly groundbreaking stuff: discovering #parallelization in 2024 like it's a rare species. 🚀🔧
    skeptrune.com/posts/git-worktr #cuttingEdgeTech #GitWorktrees #AICodingAgent #HackerNews #ngated

  14. Giới thiệu ayder-cli – Agent lập trình cục bộ hoạt động mượt với Ollama & Qwen3-Coder. Sử dụng XML thay JSON để tránh lỗi, chỉnh sửa tinh gọn, hỗ trợ tìm kiếm mã bằng ripgrep. Tự động xử lý task qua file Markdown, an toàn với xác nhận từng bước. Phù hợp Mac Silicon hoặc GPU mạnh. Dùng miễn phí, không lo hết token.
    #AICodingAgent #Ollama #Qwen3Coder #DeveloperTools #aydercli #CôngCụLậpTrình #TríTuệNhânTạo #AIĐịaPhương #LậpTrìnhMáyHọc #CodeAssistant

    reddit.com/r/LocalLLaMA/commen

  15. Anthropic leaks Claude Code Source Code : $340 billion Anthropic that wiped trillions from stock market worldwide has source code of its most-important tool leaked on internet |

    AI firm Anthropic suffered a significant sour…
    #NewsBeep #News #Artificialintelligence #AI #AIcodingagent #Anthropic #AnthropicAI #AntrophicIPO #Antrophicleak #ArtificialIntelligence #AU #Australia #Claude #ClaudeCapybara #claudecode #Claudecodesourcecodeleak #ClaudeMythos #Technology
    newsbeep.com/au/578026/

  16. #Google launched its #AIcodingagent, #Jules, out of beta. Jules, powered by #Gemini 2.5 Pro, is an #asynchronous coding tool that integrates with #GitHub and uses #AI to #fix or #update #code. The tool received structured pricing tiers, including a free plan, and updated privacy policy. techcrunch.com/2025/08/06/goog #tech #media #news

  17. #Google launched its #AIcodingagent, #Jules, out of beta. Jules, powered by #Gemini 2.5 Pro, is an #asynchronous coding tool that integrates with #GitHub and uses #AI to #fix or #update #code. The tool received structured pricing tiers, including a free plan, and updated privacy policy. techcrunch.com/2025/08/06/goog #tech #media #news

  18. #Google launched its #AIcodingagent, #Jules, out of beta. Jules, powered by #Gemini 2.5 Pro, is an #asynchronous coding tool that integrates with #GitHub and uses #AI to #fix or #update #code. The tool received structured pricing tiers, including a free plan, and updated privacy policy. techcrunch.com/2025/08/06/goog #tech #media #news

  19. #Google launched its #AIcodingagent, #Jules, out of beta. Jules, powered by #Gemini 2.5 Pro, is an #asynchronous coding tool that integrates with #GitHub and uses #AI to #fix or #update #code. The tool received structured pricing tiers, including a free plan, and updated privacy policy. techcrunch.com/2025/08/06/goog #tech #media #news