home.social

#aihallucination — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #aihallucination, aggregated by home.social.

  1. Ah, yet another groundbreaking revelation: AI models "hallucinate"—as if we didn't already know they embellish like a caffeinated novelist. 🤪 Who would've guessed that predicting the future involves a little guesswork? Maybe the next paper will enlighten us on water being wet. 🧐
    arxiv.org/abs/2401.11817 #AIhallucination #AImodels #TechHumor #PredictingTheFuture #GroundbreakingRevelation #HackerNews #ngated

  2. Ah, yet another groundbreaking revelation: AI models "hallucinate"—as if we didn't already know they embellish like a caffeinated novelist. 🤪 Who would've guessed that predicting the future involves a little guesswork? Maybe the next paper will enlighten us on water being wet. 🧐
    arxiv.org/abs/2401.11817 #AIhallucination #AImodels #TechHumor #PredictingTheFuture #GroundbreakingRevelation #HackerNews #ngated

  3. Ah, yet another groundbreaking revelation: AI models "hallucinate"—as if we didn't already know they embellish like a caffeinated novelist. 🤪 Who would've guessed that predicting the future involves a little guesswork? Maybe the next paper will enlighten us on water being wet. 🧐
    arxiv.org/abs/2401.11817 #AIhallucination #AImodels #TechHumor #PredictingTheFuture #GroundbreakingRevelation #HackerNews #ngated

  4. Ah, yet another groundbreaking revelation: AI models "hallucinate"—as if we didn't already know they embellish like a caffeinated novelist. 🤪 Who would've guessed that predicting the future involves a little guesswork? Maybe the next paper will enlighten us on water being wet. 🧐
    arxiv.org/abs/2401.11817 #AIhallucination #AImodels #TechHumor #PredictingTheFuture #GroundbreakingRevelation #HackerNews #ngated

  5. Ah, yet another groundbreaking revelation: AI models "hallucinate"—as if we didn't already know they embellish like a caffeinated novelist. 🤪 Who would've guessed that predicting the future involves a little guesswork? Maybe the next paper will enlighten us on water being wet. 🧐
    arxiv.org/abs/2401.11817 #AIhallucination #AImodels #TechHumor #PredictingTheFuture #GroundbreakingRevelation #HackerNews #ngated

  6. Elon shared that Grok 4.20 hits 83% on "non-hallucination" vs Claude's ~74%. Hallucination = when AI confidently makes up facts. Like saying Paris is Italy's capital or inventing fake research studies. 83% accuracy means it's still wrong 1 in 5 times. Better than most AIs, but you can't trust it blindly for important decisions yet. #AI #Grok #AIHallucination

  7. ...This situation goes a step above mere AI hallucinations into what the researchers call an AI “mirage.” Unlike the generative AI errors we’ve come to expect, the AI mirage is incredibly rational from start to finish...

    futurism.com/artificial-intell

    #ai #aihallucination #AImirage

  8. I asked Reddit AI for hidden gems in Switzerland 🇨🇭 .
    Here's what it suggested:

    Marmot Wrestling Rings in Valais: A quirky and unique experience.
    Aromat Mines Under Olten: A peculiar and interesting visit.
    Cheese Wars in the South: Witness a 100kg wheel of Emmental launched by a trebuchet

    reddit.com/answers/ae6c293b-9c

    Yes, I think they trained their LLM with posts from certain Swiss subreddits...

    #aihallucination #reddit #llmtraining

  9. I was compiling a little #research today on the #history of #spain investigating a little further, found a #wikipedia page, entered into a #llm & got a very odd response! #aifail or is @Wikipedia incorrect? you decide! #aihallucination #aibias @adinfinitum

  10. [The WSJ] Let AI Run [Their] Office Vending Machine. It Lost Hundreds Of Dollars.
    Anthropic’s Claude ran a snack operation in the WSJ newsroom. It gave away a free PlayStation, ordered a live fish—and taught us lessons about the future of AI agents.
    --
    wsj.com/tech/ai/anthropic-clau <-- shared media article
    --
    youtu.be/SpPhm7S9vsQ?si=aJQ2_B <-- shared video
    --
    [When you get clever journalists to !$%^&*@ with AI… bravo! And this is a very simple situation, vending machines have been around since literally the Roman Empire
    “You are using the wrong prompts” and LUDDITES! In the comments in 3… 2… 1…]
    #vendingmachine #artificialintelligence #AIHallucination #hallucinations #emperorsnewclothes #ohhhshiny #experiment #contextwindow #AIagent #claude #autonomous #compliance #fish #PlayStation #snackliberationday #knowledgeboundaries #guardrails #redteam #GenAI cynicism
    @WSJ @Anthropic @Claude

  11. [The WSJ] Let AI Run [Their] Office Vending Machine. It Lost Hundreds Of Dollars.
    Anthropic’s Claude ran a snack operation in the WSJ newsroom. It gave away a free PlayStation, ordered a live fish—and taught us lessons about the future of AI agents.
    --
    wsj.com/tech/ai/anthropic-clau <-- shared media article
    --
    youtu.be/SpPhm7S9vsQ?si=aJQ2_B <-- shared video
    --
    [When you get clever journalists to !$%^&*@ with AI… bravo! And this is a very simple situation, vending machines have been around since literally the Roman Empire
    “You are using the wrong prompts” and LUDDITES! In the comments in 3… 2… 1…]
    #vendingmachine #artificialintelligence #AIHallucination #hallucinations #emperorsnewclothes #ohhhshiny #experiment #contextwindow #AIagent #claude #autonomous #compliance #fish #PlayStation #snackliberationday #knowledgeboundaries #guardrails #redteam #GenAI cynicism
    @WSJ @Anthropic @Claude

  12. [The WSJ] Let AI Run [Their] Office Vending Machine. It Lost Hundreds Of Dollars.
    Anthropic’s Claude ran a snack operation in the WSJ newsroom. It gave away a free PlayStation, ordered a live fish—and taught us lessons about the future of AI agents.
    --
    wsj.com/tech/ai/anthropic-clau <-- shared media article
    --
    youtu.be/SpPhm7S9vsQ?si=aJQ2_B <-- shared video
    --
    [When you get clever journalists to !$%^&*@ with AI… bravo! And this is a very simple situation, vending machines have been around since literally the Roman Empire
    “You are using the wrong prompts” and LUDDITES! In the comments in 3… 2… 1…]
    #vendingmachine #artificialintelligence #AIHallucination #hallucinations #emperorsnewclothes #ohhhshiny #experiment #contextwindow #AIagent #claude #autonomous #compliance #fish #PlayStation #snackliberationday #knowledgeboundaries #guardrails #redteam #GenAI cynicism
    @WSJ @Anthropic @Claude

  13. [The WSJ] Let AI Run [Their] Office Vending Machine. It Lost Hundreds Of Dollars.
    Anthropic’s Claude ran a snack operation in the WSJ newsroom. It gave away a free PlayStation, ordered a live fish—and taught us lessons about the future of AI agents.
    --
    wsj.com/tech/ai/anthropic-clau <-- shared media article
    --
    youtu.be/SpPhm7S9vsQ?si=aJQ2_B <-- shared video
    --
    [When you get clever journalists to !$%^&*@ with AI… bravo! And this is a very simple situation, vending machines have been around since literally the Roman Empire
    “You are using the wrong prompts” and LUDDITES! In the comments in 3… 2… 1…]
    #vendingmachine #artificialintelligence #AIHallucination #hallucinations #emperorsnewclothes #ohhhshiny #experiment #contextwindow #AIagent #claude #autonomous #compliance #fish #PlayStation #snackliberationday #knowledgeboundaries #guardrails #redteam #GenAI cynicism
    @WSJ @Anthropic @Claude

  14. [The WSJ] Let AI Run [Their] Office Vending Machine. It Lost Hundreds Of Dollars.
    Anthropic’s Claude ran a snack operation in the WSJ newsroom. It gave away a free PlayStation, ordered a live fish—and taught us lessons about the future of AI agents.
    --
    wsj.com/tech/ai/anthropic-clau <-- shared media article
    --
    youtu.be/SpPhm7S9vsQ?si=aJQ2_B <-- shared video
    --
    [When you get clever journalists to !$%^&*@ with AI… bravo! And this is a very simple situation, vending machines have been around since literally the Roman Empire
    “You are using the wrong prompts” and LUDDITES! In the comments in 3… 2… 1…]
    cynicism
    @WSJ @Anthropic @Claude

  15. 🚀 A fresh benchmark shows even top LLMs still hallucinate when they cite seemingly legit sources. The study probes content grounding, reference verification, and citation accuracy across models like Claude Opus. Open‑source folks, see where the gaps are and how web‑search integration could help. Dive into the findings! #AIHallucination #LargeLanguageModels #CitationAccuracy #ContentGrounding

    🔗 aidailypost.com/news/new-bench

  16. A mate searched for 'Olympic snowboard halfpipe final 2026' using #AI in #BraveBrowser, and it came back with the following:

    "The men's snowboard halfpipe final at the 2026 Winter Olympics in Milano Cortina took place on Friday, February 13, 2026, at 1:30 p.m. EST (19:30 Milano Cortina time) at the Livigno Snow Park in Valtellina.

    The qualification round occurred on Wednesday, February 11, with the top 12 athletes advancing to the final.
    Australia’s Scotty James emerged as the standout performer, securing a record-extending fifth Laax Open victory in January 2026, which positioned him as a top contender.
    Canada’s Éliot Grondin won the World Cup silver in snowboard cross shortly before the Olympics, adding to the competitive depth.
    The final was broadcast live on NBC, Peacock, and USA Network, with re-airings available on Peacock and USA Network.
    For viewers, the event was accessible via Peacock, NBC Olympics, and the NBC Sports app, with live streaming available on mobile, tablet, and connected TV devices."

    Note the dates and past-tense 🙄

    Fucking bullshit machines 🤬

    #AISlop #AIHallucination

  17. Now my mate did the animate the photo thing and the AI hallucinated a fourth person. Also, that is definitely not my face.

    #aihallucination

  18. I was chatting to a mate last night, and he had a problem with some antivirus software at work, so he asked #ChatGPT for a solution, and it said to upgrade to a specific version to fix it.

    He logged a call with support asking for a download of the newer version.

    Support were confused and said "what new version? that version doesn't exist!".

    I laughed so hard, and took the piss out of him for a few minutes. He's a technical chap, and should have known better 🤣 🤣 🤣 🤣 🤣

    #AISlop #AIHallucination #AI

  19. The Poisoned Well: When Your AI Partner Suddenly Turns into a Stranger

    I thought I had a collaborative partner. One silent system change proved I had a hallucinating stranger.

    Welcome to my latest mini-series, The Poisoned Well. In this three-part deep dive, I will explain how and why data corruption and memory loss occur in AI models, even in fresh, newer chats, and why it happens when you least expect it.

    airecoverycollective.substack.

    #aihallucination #ai #AIcontext #gaslighting #aifailure

  20. AI's credibility takes a hit! West Midlands Police's intelligence report falls prey to Microsoft Copilot's fictional football match. This alarming incident exposes critical risks of AI hallucinations in professional settings. How can law enforcement trust generative AI when it fabricates entire scenarios? 🤖🚨 #MicrosoftCopilot #ArtificialIntelligence #AIHallucination #LawEnforcementTech

    🔗 aidailypost.com/news/uk-police

  21. Cẩn thận với "ảo giác" của AI! Các mô hình ngôn ngữ lớn có thể tự tin tạo ra thông tin sai lệch hoặc code lỗi. Lập trình viên hãy luôn xem nó là bản nháp, kiểm chứng kỹ và giữ sự giám sát của con người để tránh rủi ro.

    #AI #LậpTrình #CôngNghệ #ẢoGiácAI #Dev #Tech #AIHallucination

    dev.to/himanshu_jetani_0a4817c

  22. From #BBC: "Largest study of its kind shows #AI assistants misrepresent news content 45% of the time – regardless of language or territory"

    bbc.com/mediacentre/2025/new-e

    #AIAssistant #AIHallucination

  23. #Deloitte Australia used AI to churn out a report with made-up quotes, then offered only a “partial” refund? After the #PwC & #KPMG scandals, the Big Four are treating the Australian government like a cash cow. Absolutely shocking! #AusPol #AIHallucination #Accountability #FinancialServices #Ethics

    Deloitte to partially refund A...

  24. A good friend of mine has the Muppet character "Beaker" as his avatar. For reasons.

    He offers me advice. I offer him advice. We chat. These are #ChatsWithBeaker

    #Deloitte #AI #AIHallucination

    (the article in question)

    arstechnica.com/ai/2025/10/del

  25. 📚🤯 Oh, bless Robin Sloan, the all-seeing sage who can differentiate between 'knowledge' and 'memory' like no one's business. Meanwhile, Claude the language model is out here hallucinating Ruby methods like it's an AI acid trip. Clearly, only Robin's sedimentary brain can save us from the abyss of airy guesses. 🙄💡
    robinsloan.com/lab/knowledge-a #RobinSloan #AIHallucination #KnowledgeVsMemory #LanguageModel #TechInsights #HackerNews #ngated