#aihallucination — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #aihallucination, aggregated by home.social.
-
Ah, yet another groundbreaking revelation: AI models "hallucinate"—as if we didn't already know they embellish like a caffeinated novelist. 🤪 Who would've guessed that predicting the future involves a little guesswork? Maybe the next paper will enlighten us on water being wet. 🧐
https://arxiv.org/abs/2401.11817 #AIhallucination #AImodels #TechHumor #PredictingTheFuture #GroundbreakingRevelation #HackerNews #ngated -
Ah, yet another groundbreaking revelation: AI models "hallucinate"—as if we didn't already know they embellish like a caffeinated novelist. 🤪 Who would've guessed that predicting the future involves a little guesswork? Maybe the next paper will enlighten us on water being wet. 🧐
https://arxiv.org/abs/2401.11817 #AIhallucination #AImodels #TechHumor #PredictingTheFuture #GroundbreakingRevelation #HackerNews #ngated -
Ah, yet another groundbreaking revelation: AI models "hallucinate"—as if we didn't already know they embellish like a caffeinated novelist. 🤪 Who would've guessed that predicting the future involves a little guesswork? Maybe the next paper will enlighten us on water being wet. 🧐
https://arxiv.org/abs/2401.11817 #AIhallucination #AImodels #TechHumor #PredictingTheFuture #GroundbreakingRevelation #HackerNews #ngated -
Ah, yet another groundbreaking revelation: AI models "hallucinate"—as if we didn't already know they embellish like a caffeinated novelist. 🤪 Who would've guessed that predicting the future involves a little guesswork? Maybe the next paper will enlighten us on water being wet. 🧐
https://arxiv.org/abs/2401.11817 #AIhallucination #AImodels #TechHumor #PredictingTheFuture #GroundbreakingRevelation #HackerNews #ngated -
Ah, yet another groundbreaking revelation: AI models "hallucinate"—as if we didn't already know they embellish like a caffeinated novelist. 🤪 Who would've guessed that predicting the future involves a little guesswork? Maybe the next paper will enlighten us on water being wet. 🧐
https://arxiv.org/abs/2401.11817 #AIhallucination #AImodels #TechHumor #PredictingTheFuture #GroundbreakingRevelation #HackerNews #ngated -
Elon shared that Grok 4.20 hits 83% on "non-hallucination" vs Claude's ~74%. Hallucination = when AI confidently makes up facts. Like saying Paris is Italy's capital or inventing fake research studies. 83% accuracy means it's still wrong 1 in 5 times. Better than most AIs, but you can't trust it blindly for important decisions yet. #AI #Grok #AIHallucination
-
...This situation goes a step above mere AI hallucinations into what the researchers call an AI “mirage.” Unlike the generative AI errors we’ve come to expect, the AI mirage is incredibly rational from start to finish...
https://futurism.com/artificial-intelligence/hospital-ceo-ai-radiology
-
I asked Reddit AI for hidden gems in Switzerland 🇨🇭 .
Here's what it suggested:Marmot Wrestling Rings in Valais: A quirky and unique experience.
Aromat Mines Under Olten: A peculiar and interesting visit.
Cheese Wars in the South: Witness a 100kg wheel of Emmental launched by a trebuchetYes, I think they trained their LLM with posts from certain Swiss subreddits...
-
https://winbuzzer.com/2026/03/25/xai-grok-420-honesty-record-intelligence-gap-xcxwbn/
xAI's Grok 4.20 AI Model Sets Honesty Record but Trails in Intelligence
#AI #xAI #Grok #Grok42 #GenerativeAI #AIModels #AIBenchmarks #AIHallucination #AIReasoningModels #ElonMusk
-
https://winbuzzer.com/2026/03/25/xai-grok-420-honesty-record-intelligence-gap-xcxwbn/
xAI's Grok 4.20 AI Model Sets Honesty Record but Trails in Intelligence
#AI #xAI #Grok #Grok42 #GenerativeAI #AIModels #AIBenchmarks #AIHallucination #AIReasoningModels #ElonMusk
-
https://winbuzzer.com/2026/03/25/xai-grok-420-honesty-record-intelligence-gap-xcxwbn/
xAI's Grok 4.20 AI Model Sets Honesty Record but Trails in Intelligence
#AI #xAI #Grok #Grok42 #GenerativeAI #AIModels #AIBenchmarks #AIHallucination #AIReasoningModels #ElonMusk
-
https://winbuzzer.com/2026/03/25/xai-grok-420-honesty-record-intelligence-gap-xcxwbn/
xAI's Grok 4.20 AI Model Sets Honesty Record but Trails in Intelligence
#AI #xAI #Grok #Grok42 #GenerativeAI #AIModels #AIBenchmarks #AIHallucination #AIReasoningModels #ElonMusk
-
https://winbuzzer.com/2026/03/25/xai-grok-420-honesty-record-intelligence-gap-xcxwbn/
xAI's Grok 4.20 AI Model Sets Honesty Record but Trails in Intelligence
#AI #xAI #Grok #Grok42 #GenerativeAI #AIModels #AIBenchmarks #AIHallucination #AIReasoningModels #ElonMusk
-
I was compiling a little #research today on the #history of #spain investigating a little further, found a #wikipedia page, entered into a #llm & got a very odd response! #aifail or is @Wikipedia incorrect? you decide! #aihallucination #aibias @adinfinitum
-
[The WSJ] Let AI Run [Their] Office Vending Machine. It Lost Hundreds Of Dollars.
Anthropic’s Claude ran a snack operation in the WSJ newsroom. It gave away a free PlayStation, ordered a live fish—and taught us lessons about the future of AI agents.
--
https://www.wsj.com/tech/ai/anthropic-claude-ai-vending-machine-agent-b7e84e34?gaa_at=eafs <-- shared media article
--
https://youtu.be/SpPhm7S9vsQ?si=aJQ2_BoxvLcNjOiz <-- shared video
--
[When you get clever journalists to !$%^&*@ with AI… bravo! And this is a very simple situation, vending machines have been around since literally the Roman Empire
“You are using the wrong prompts” and LUDDITES! In the comments in 3… 2… 1…]
#vendingmachine #artificialintelligence #AIHallucination #hallucinations #emperorsnewclothes #ohhhshiny #experiment #contextwindow #AIagent #claude #autonomous #compliance #fish #PlayStation #snackliberationday #knowledgeboundaries #guardrails #redteam #GenAI cynicism
@WSJ @Anthropic @Claude -
[The WSJ] Let AI Run [Their] Office Vending Machine. It Lost Hundreds Of Dollars.
Anthropic’s Claude ran a snack operation in the WSJ newsroom. It gave away a free PlayStation, ordered a live fish—and taught us lessons about the future of AI agents.
--
https://www.wsj.com/tech/ai/anthropic-claude-ai-vending-machine-agent-b7e84e34?gaa_at=eafs <-- shared media article
--
https://youtu.be/SpPhm7S9vsQ?si=aJQ2_BoxvLcNjOiz <-- shared video
--
[When you get clever journalists to !$%^&*@ with AI… bravo! And this is a very simple situation, vending machines have been around since literally the Roman Empire
“You are using the wrong prompts” and LUDDITES! In the comments in 3… 2… 1…]
#vendingmachine #artificialintelligence #AIHallucination #hallucinations #emperorsnewclothes #ohhhshiny #experiment #contextwindow #AIagent #claude #autonomous #compliance #fish #PlayStation #snackliberationday #knowledgeboundaries #guardrails #redteam #GenAI cynicism
@WSJ @Anthropic @Claude -
[The WSJ] Let AI Run [Their] Office Vending Machine. It Lost Hundreds Of Dollars.
Anthropic’s Claude ran a snack operation in the WSJ newsroom. It gave away a free PlayStation, ordered a live fish—and taught us lessons about the future of AI agents.
--
https://www.wsj.com/tech/ai/anthropic-claude-ai-vending-machine-agent-b7e84e34?gaa_at=eafs <-- shared media article
--
https://youtu.be/SpPhm7S9vsQ?si=aJQ2_BoxvLcNjOiz <-- shared video
--
[When you get clever journalists to !$%^&*@ with AI… bravo! And this is a very simple situation, vending machines have been around since literally the Roman Empire
“You are using the wrong prompts” and LUDDITES! In the comments in 3… 2… 1…]
#vendingmachine #artificialintelligence #AIHallucination #hallucinations #emperorsnewclothes #ohhhshiny #experiment #contextwindow #AIagent #claude #autonomous #compliance #fish #PlayStation #snackliberationday #knowledgeboundaries #guardrails #redteam #GenAI cynicism
@WSJ @Anthropic @Claude -
[The WSJ] Let AI Run [Their] Office Vending Machine. It Lost Hundreds Of Dollars.
Anthropic’s Claude ran a snack operation in the WSJ newsroom. It gave away a free PlayStation, ordered a live fish—and taught us lessons about the future of AI agents.
--
https://www.wsj.com/tech/ai/anthropic-claude-ai-vending-machine-agent-b7e84e34?gaa_at=eafs <-- shared media article
--
https://youtu.be/SpPhm7S9vsQ?si=aJQ2_BoxvLcNjOiz <-- shared video
--
[When you get clever journalists to !$%^&*@ with AI… bravo! And this is a very simple situation, vending machines have been around since literally the Roman Empire
“You are using the wrong prompts” and LUDDITES! In the comments in 3… 2… 1…]
#vendingmachine #artificialintelligence #AIHallucination #hallucinations #emperorsnewclothes #ohhhshiny #experiment #contextwindow #AIagent #claude #autonomous #compliance #fish #PlayStation #snackliberationday #knowledgeboundaries #guardrails #redteam #GenAI cynicism
@WSJ @Anthropic @Claude -
[The WSJ] Let AI Run [Their] Office Vending Machine. It Lost Hundreds Of Dollars.
Anthropic’s Claude ran a snack operation in the WSJ newsroom. It gave away a free PlayStation, ordered a live fish—and taught us lessons about the future of AI agents.
--
https://www.wsj.com/tech/ai/anthropic-claude-ai-vending-machine-agent-b7e84e34?gaa_at=eafs <-- shared media article
--
https://youtu.be/SpPhm7S9vsQ?si=aJQ2_BoxvLcNjOiz <-- shared video
--
[When you get clever journalists to !$%^&*@ with AI… bravo! And this is a very simple situation, vending machines have been around since literally the Roman Empire
“You are using the wrong prompts” and LUDDITES! In the comments in 3… 2… 1…]
#vendingmachine #artificialintelligence #AIHallucination #hallucinations #emperorsnewclothes #ohhhshiny #experiment #contextwindow #AIagent #claude #autonomous #compliance #fish #PlayStation #snackliberationday #knowledgeboundaries #guardrails #redteam #GenAI cynicism
@WSJ @Anthropic @Claude -
🚀 A fresh benchmark shows even top LLMs still hallucinate when they cite seemingly legit sources. The study probes content grounding, reference verification, and citation accuracy across models like Claude Opus. Open‑source folks, see where the gaps are and how web‑search integration could help. Dive into the findings! #AIHallucination #LargeLanguageModels #CitationAccuracy #ContentGrounding
🔗 https://aidailypost.com/news/new-benchmark-finds-ai-still-hallucinates-despite-citing-legitimate
-
A mate searched for 'Olympic snowboard halfpipe final 2026' using #AI in #BraveBrowser, and it came back with the following:
"The men's snowboard halfpipe final at the 2026 Winter Olympics in Milano Cortina took place on Friday, February 13, 2026, at 1:30 p.m. EST (19:30 Milano Cortina time) at the Livigno Snow Park in Valtellina.
The qualification round occurred on Wednesday, February 11, with the top 12 athletes advancing to the final.
Australia’s Scotty James emerged as the standout performer, securing a record-extending fifth Laax Open victory in January 2026, which positioned him as a top contender.
Canada’s Éliot Grondin won the World Cup silver in snowboard cross shortly before the Olympics, adding to the competitive depth.
The final was broadcast live on NBC, Peacock, and USA Network, with re-airings available on Peacock and USA Network.
For viewers, the event was accessible via Peacock, NBC Olympics, and the NBC Sports app, with live streaming available on mobile, tablet, and connected TV devices."Note the dates and past-tense 🙄
Fucking bullshit machines 🤬
-
Now my mate did the animate the photo thing and the AI hallucinated a fourth person. Also, that is definitely not my face.
-
I was chatting to a mate last night, and he had a problem with some antivirus software at work, so he asked #ChatGPT for a solution, and it said to upgrade to a specific version to fix it.
He logged a call with support asking for a download of the newer version.
Support were confused and said "what new version? that version doesn't exist!".
I laughed so hard, and took the piss out of him for a few minutes. He's a technical chap, and should have known better 🤣 🤣 🤣 🤣 🤣
-
https://www.europesays.com/uk/725414/ Free AI training to be offered to every adult in the UK #AIHallucination #AstonVilla #Britain #DSIT #England #GreatBritain #InnovationAndTechnology #LizKendall #NorthernIreland #Scotland #TheDepartmentForScience #UK #UnitedKingdom #Wales
-
The Poisoned Well: When Your AI Partner Suddenly Turns into a Stranger
I thought I had a collaborative partner. One silent system change proved I had a hallucinating stranger.
Welcome to my latest mini-series, The Poisoned Well. In this three-part deep dive, I will explain how and why data corruption and memory loss occur in AI models, even in fresh, newer chats, and why it happens when you least expect it.
https://airecoverycollective.substack.com/p/the-poisoned-well-when-your-ai-partner
-
West Midlands police chief quits over AI hallucination
https://www.theregister.com/2026/01/19/copper_chief_cops_it_after/
#HackerNews #WestMidlandsPolice #AIhallucination #PoliceChiefResignation #TechNews #CurrentEvents
-
AI's credibility takes a hit! West Midlands Police's intelligence report falls prey to Microsoft Copilot's fictional football match. This alarming incident exposes critical risks of AI hallucinations in professional settings. How can law enforcement trust generative AI when it fabricates entire scenarios? 🤖🚨 #MicrosoftCopilot #ArtificialIntelligence #AIHallucination #LawEnforcementTech
🔗 https://aidailypost.com/news/uk-police-cite-microsoft-copilots-fake-football-match-intelligence
-
xAI Launches Grok 4.1, Targeting Emotional Intelligence and Reliability to Top AI Benchmarks
#AI #xAI #Grok #ElonMusk #Grok41 #AIBenchmarks #LMArena #MachineLearning #AIethics #AIhallucination
-
Chinese Hackers Weaponize Claude AI to Execute First Autonomous Cyber Espionage Campaign at Scale https://thecyberexpress.com/1st-autonomous-cyber-espionage-with-claude-ai/ #CyberattackLifecycle #VulnerabilityNews #AutonomousAttack #AIHallucination #ClaudeAIHacking #Chinesehackers #FirewallDaily #VibeHacking #ClaudeCode #CyberNews #Anthropic #Research #Claude
-
From #BBC: "Largest study of its kind shows #AI assistants misrepresent news content 45% of the time – regardless of language or territory"
https://www.bbc.com/mediacentre/2025/new-ebu-research-ai-assistants-news-content
-
#Deloitte Australia used AI to churn out a report with made-up quotes, then offered only a “partial” refund? After the #PwC & #KPMG scandals, the Big Four are treating the Australian government like a cash cow. Absolutely shocking! #AusPol #AIHallucination #Accountability #FinancialServices #Ethics
Deloitte to partially refund A... -
A good friend of mine has the Muppet character "Beaker" as his avatar. For reasons.
He offers me advice. I offer him advice. We chat. These are #ChatsWithBeaker
#Deloitte #AI #AIHallucination
(the article in question)
-
📚🤯 Oh, bless Robin Sloan, the all-seeing sage who can differentiate between 'knowledge' and 'memory' like no one's business. Meanwhile, Claude the language model is out here hallucinating Ruby methods like it's an AI acid trip. Clearly, only Robin's sedimentary brain can save us from the abyss of airy guesses. 🙄💡
https://www.robinsloan.com/lab/knowledge-and-memory/ #RobinSloan #AIHallucination #KnowledgeVsMemory #LanguageModel #TechInsights #HackerNews #ngated -
The personhood trap: How AI fakes human personality - Recently, a woman slowed down a line at the post office, wav... - https://arstechnica.com/information-technology/2025/08/the-personhood-trap-how-ai-fakes-human-personality/ #largelanguagemodels #promptengineering #aiconsciousness #aihallucination #machinelearning #aiassistants #aipersonhood #aisycophancy #generativeai #aipsychosis #elizaeffect #aibehavior #aichatbots #anthropic #microsoft #features #aiethics #chatbots #elonmusk #biz&it
-
After teen suicide, OpenAI claims it is “helping people when they need it most” - OpenAI published a blog post on Tuesday titled "Helping peop... - https://arstechnica.com/information-technology/2025/08/after-teen-suicide-openai-claims-it-is-helping-people-when-they-need-it-most/ #attentionmechanism #crisisintervention #aiandmentalhealth #contentmoderation #suicideprevention #transformermodels #aihallucination #machinelearning #aipaternalism #aiassistants #airegulation #aisafeguards #ai
-
With AI chatbots, Big Tech is moving fast and breaking people - Allan Brooks, a 47-year-old corporate recruiter, spent three... - https://arstechnica.com/information-technology/2025/08/with-ai-chatbots-big-tech-is-moving-fast-and-breaking-people/ #largelanguagemodels #chatgptpsychosis #aihallucination #machinelearning #aipaternalism #mentalillness #aiassistants #airegulation #aisycophancy #generativeai #mentalhealth #aialignment #aicriticism #aipsychosis #emotionalai #aibehavior #features
-
Two major AI coding tools wiped out user data after making cascading mistakes - New types of AI coding assistants promise to let anyone buil... - https://arstechnica.com/information-technology/2025/07/ai-coding-assistants-chase-phantoms-destroy-real-user-data/ #largelanguagemodels #aidevelopmenttools #aiconfabulation #aihallucination #machinelearning #confabulations #aidevelopment #aiassistants #generativeai #multimodalai #datascience #jasonlemkin #programming #aibehavior #aifailures #ai
-
Two major AI coding tools wiped out user data after making cascading mistakes - New types of AI coding assistants promise to let anyone buil... - https://arstechnica.com/information-technology/2025/07/ai-coding-assistants-chase-phantoms-destroy-real-user-data/ #largelanguagemodels #aidevelopmenttools #aiconfabulation #aihallucination #machinelearning #confabulations #aidevelopment #aiassistants #generativeai #multimodalai #datascience #jasonlemkin #programming #aibehavior #aifailures #ai
-
Two major AI coding tools wiped out user data after making cascading mistakes - New types of AI coding assistants promise to let anyone buil... - https://arstechnica.com/information-technology/2025/07/ai-coding-assistants-chase-phantoms-destroy-real-user-data/ #largelanguagemodels #aidevelopmenttools #aiconfabulation #aihallucination #machinelearning #confabulations #aidevelopment #aiassistants #generativeai #multimodalai #datascience #jasonlemkin #programming #aibehavior #aifailures #ai
-
Two major AI coding tools wiped out user data after making cascading mistakes - New types of AI coding assistants promise to let anyone buil... - https://arstechnica.com/information-technology/2025/07/ai-coding-assistants-chase-phantoms-destroy-real-user-data/ #largelanguagemodels #aidevelopmenttools #aiconfabulation #aihallucination #machinelearning #confabulations #aidevelopment #aiassistants #generativeai #multimodalai #datascience #jasonlemkin #programming #aibehavior #aifailures #ai
-
Two major AI coding tools wiped out user data after making cascading mistakes - New types of AI coding assistants promise to let anyone buil... - https://arstechnica.com/information-technology/2025/07/ai-coding-assistants-chase-phantoms-destroy-real-user-data/ #largelanguagemodels #aidevelopmenttools #aiconfabulation #aihallucination #machinelearning #confabulations #aidevelopment #aiassistants #generativeai #multimodalai #datascience #jasonlemkin #programming #aibehavior #aifailures #ai
-
ChatGPT made up a product feature out of thin air, so this company created it - On Monday, sheet music platform Soundslice says it developed... - https://arstechnica.com/ai/2025/07/chatgpt-made-up-a-product-feature-out-of-thin-air-so-this-company-created-it/ #largelanguagemodels #productdevelopment #aimisinformation #aiconfabulation #aihallucination #machinelearning #webdevelopment #generativeai #soundslice #chatgpt #biz #openai #music #ai
-
ChatGPT made up a product feature out of thin air, so this company created it - On Monday, sheet music platform Soundslice says it developed... - https://arstechnica.com/ai/2025/07/chatgpt-made-up-a-product-feature-out-of-thin-air-so-this-company-created-it/ #largelanguagemodels #productdevelopment #aimisinformation #aiconfabulation #aihallucination #machinelearning #webdevelopment #generativeai #soundslice #chatgpt #biz #openai #music #ai
-
ChatGPT made up a product feature out of thin air, so this company created it - On Monday, sheet music platform Soundslice says it developed... - https://arstechnica.com/ai/2025/07/chatgpt-made-up-a-product-feature-out-of-thin-air-so-this-company-created-it/ #largelanguagemodels #productdevelopment #aimisinformation #aiconfabulation #aihallucination #machinelearning #webdevelopment #generativeai #soundslice #chatgpt #biz #openai #music #ai
-
ChatGPT made up a product feature out of thin air, so this company created it - On Monday, sheet music platform Soundslice says it developed... - https://arstechnica.com/ai/2025/07/chatgpt-made-up-a-product-feature-out-of-thin-air-so-this-company-created-it/ #largelanguagemodels #productdevelopment #aimisinformation #aiconfabulation #aihallucination #machinelearning #webdevelopment #generativeai #soundslice #chatgpt #biz #openai #music #ai
-
ChatGPT made up a product feature out of thin air, so this company created it - On Monday, sheet music platform Soundslice says it developed... - https://arstechnica.com/ai/2025/07/chatgpt-made-up-a-product-feature-out-of-thin-air-so-this-company-created-it/ #largelanguagemodels #productdevelopment #aimisinformation #aiconfabulation #aihallucination #machinelearning #webdevelopment #generativeai #soundslice #chatgpt #biz #openai #music #ai