#reasoningmodels — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #reasoningmodels, aggregated by home.social.
-
OpenAI claims it solved an 80-year-old math problem — for real this time
OpenAI claims its new reasoning model has produced an original mathematical proof disproving a famous unsolved conjecture in…
#NewsBeep #News #US #USA #UnitedStates #UnitedStatesOfAmerica #Artificialintelligence #AI #ArtificialIntelligence #ChatGPT #erdosproblems #OpenAI #reasoningmodels #Technology
https://www.newsbeep.com/us/655490/ -
OpenAI claims it solved an 80-year-old math problem — for real this time
OpenAI claims its new reasoning model has produced an original mathematical proof disproving a famous unsolved conjecture in…
#NewsBeep #News #US #USA #UnitedStates #UnitedStatesOfAmerica #Artificialintelligence #AI #ArtificialIntelligence #ChatGPT #erdosproblems #OpenAI #reasoningmodels #Technology
https://www.newsbeep.com/us/655490/ -
Arcee AI released Trinity-Large-Thinking, a 400B parameter open-source reasoning model that scores within 2 points of Claude Opus on PinchBench while costing 96% less at $0.90 per million tokens. Uses sparse architecture activating only 13B parameters per token. Trained for $20M by 30-person team. #OpenSource #AI #ReasoningModels
-
xAI’s co‑founder exits keep coming, while Lambda outlines a 2025 shift toward bigger context windows, multimodal reasoning models and open‑source inference for AI production. What could this mean for the future of machine learning? Read on for the full story. #AIProduction #ReasoningModels #MultimodalAI #OpenSourceInference
🔗 https://aidailypost.com/news/xai-co-founder-departures-persist-lambda-outlines-2025-ai-production
-
AI that thinks instead of guessing?
Reasoning models use techniques like chain of thought and tree of thought to decompose problems, explore alternatives, and choose better answers, often at the cost of more compute and latency.
A practical explainer:
🔗 https://techglimmer.io/what-is-ai-thinking-reasoning-models/#AI #ReasoningModels #ChainOfThought #TreeOfThought #GenAI #FediTech #MachineLearning
-
2025 saw significant advancements in #LLMs, particularly in the areas of #reasoning and #agent based systems. #Reasoningmodels, capable of breaking down #complextasks and utilising tools, revolutionised #coding and #search. The year witnessed the rise of #codingagents, exemplified by #ClaudeCode, which can autonomously write, execute, and refine code. https://simonwillison.net/2025/Dec/31/the-year-in-llms/?eicker.news #tech #media #news
-
2025 saw significant advancements in #LLMs, particularly in the areas of #reasoning and #agent based systems. #Reasoningmodels, capable of breaking down #complextasks and utilising tools, revolutionised #coding and #search. The year witnessed the rise of #codingagents, exemplified by #ClaudeCode, which can autonomously write, execute, and refine code. https://simonwillison.net/2025/Dec/31/the-year-in-llms/?eicker.news #tech #media #news
-
2025 saw significant advancements in #LLMs, particularly in the areas of #reasoning and #agent based systems. #Reasoningmodels, capable of breaking down #complextasks and utilising tools, revolutionised #coding and #search. The year witnessed the rise of #codingagents, exemplified by #ClaudeCode, which can autonomously write, execute, and refine code. https://simonwillison.net/2025/Dec/31/the-year-in-llms/?eicker.news #tech #media #news
-
2025 saw significant advancements in #LLMs, particularly in the areas of #reasoning and #agent based systems. #Reasoningmodels, capable of breaking down #complextasks and utilising tools, revolutionised #coding and #search. The year witnessed the rise of #codingagents, exemplified by #ClaudeCode, which can autonomously write, execute, and refine code. https://simonwillison.net/2025/Dec/31/the-year-in-llms/?eicker.news #tech #media #news
-
2025 saw significant advancements in #LLMs, particularly in the areas of #reasoning and #agent based systems. #Reasoningmodels, capable of breaking down #complextasks and utilising tools, revolutionised #coding and #search. The year witnessed the rise of #codingagents, exemplified by #ClaudeCode, which can autonomously write, execute, and refine code. https://simonwillison.net/2025/Dec/31/the-year-in-llms/?eicker.news #tech #media #news
-
OpenAI: GPT-5 Thinking Models Are The Most "Monitarable" Models To Date
#AI #OpenAI #AISafety #LLM #MachineLearning #GPT5 #DeepMind #AIResearch #ChainOfThought #Monitorability #AIAlignment #ReasoningModels
-
OpenAI: GPT-5 Thinking Models Are The Most "Monitarable" Models To Date
#AI #OpenAI #AISafety #LLM #MachineLearning #GPT5 #DeepMind #AIResearch #ChainOfThought #Monitorability #AIAlignment #ReasoningModels
-
OpenAI: GPT-5 Thinking Models Are The Most "Monitarable" Models To Date
#AI #OpenAI #AISafety #LLM #MachineLearning #GPT5 #DeepMind #AIResearch #ChainOfThought #Monitorability #AIAlignment #ReasoningModels
-
OpenAI: GPT-5 Thinking Models Are The Most "Monitarable" Models To Date
#AI #OpenAI #AISafety #LLM #MachineLearning #GPT5 #DeepMind #AIResearch #ChainOfThought #Monitorability #AIAlignment #ReasoningModels
-
OpenAI: GPT-5 Thinking Models Are The Most "Monitarable" Models To Date
#AI #OpenAI #AISafety #LLM #MachineLearning #GPT5 #DeepMind #AIResearch #ChainOfThought #Monitorability #AIAlignment #ReasoningModels
-
Reasoning Models Reason Well, Until They Don't
https://arxiv.org/abs/2510.22371
#HackerNews #ReasoningModels #ReasonWell #AIResearch #MachineLearning #HackerNews
-
"The point is that with each advance in AI, new hurdles become apparent; when one missing aspect of “intelligence” is filled in, we find ourselves bumping up against another gap. When I speculated about GPT-5 last year, it didn’t occur to me to question whether it would know how to set priorities, because the models of the time weren’t even capable enough for that to be a limiting factor. In a post from November, AI is Racing Forward – on a Very Long Road, I wrote:
…the real challenges may be things that we can’t easily anticipate right now, weaknesses that we will only start to put our finger on when we observe [future models] performing astonishing feats and yet somehow still not being able to write that tightly-plotted novel.
In April 2024, it seemed like agentic AI was going to be the next big thing. The ensuing 16 months have brought enormous progress on many fronts, but very little progress on real-world agency. With projects like AI Village shining a light on the profound weakness of current AI agents, I think robust real-world capability is still years away."
https://secondthoughts.ai/p/gpt-5-the-case-of-the-missing-agent
#AI #GenerativeAI #LLMs #Chatbots #AIAgents #AgenticAI #ReasoningModels
-
🧠 What if you could tell AI how much to think before answering?
Seed-OSS 36B gives builders a thinking budget knob + 512K context window—control depth vs speed like never before. ⚡👉 See how it changes product SLAs, costs, and user experience:
https://medium.com/@rogt.x1997/seed-oss-36b-a-tweakable-reasoning-engine-for-long-context-work-66aa05a72548#AI #ReasoningModels #LongContext
https://medium.com/@rogt.x1997/seed-oss-36b-a-tweakable-reasoning-engine-for-long-context-work-66aa05a72548 -
Seven so-called "replies" to Apple's paper on reasoning models, or as I like to call them, seven exercises in missing the point entirely. 📚🤦♂️ It's almost like a bad magic trick: look over here at these rebuttals while we pretend the original issue just vanishes! 🎩✨
https://garymarcus.substack.com/p/seven-replies-to-the-viral-apple #AppleReplies #ReasoningModels #MissingThePoint #BadMagicTrick #TechCritique #HackerNews #ngated -
OpenAI Releases new o3-Pro AI Model: A High-Stakes Bet on AI Reliability
#AI #OpenAI #o3pro #LLM #TechNews #ReasoningModels #ChatGPT #EnterpriseAI #AIEthics
-
The Illusion of Thinking: Strengths and Limitations of Reasoning Models
https://machinelearning.apple.com/research/illusion-of-thinking
#HackerNews #IllusionOfThinking #ReasoningModels #StrengthsAndLimitations #AIResearch #MachineLearning
-
No more guessing games! 🕵️♂️ #ollama's new 'think' feature cleanly separates the model's internal thinking from the content. Easy to enable - just 'think': true in your API request. #AIdevelopment #ReasoningModels https://youtu.be/yBD598s5g8c
-
DeepSeek R1 AI Model Update Boosts Reasoning, Catching up With OpenAI o3 and Gemini 2.5 Pro
#AI #DeepSeek #GenAI #LLM #DeepSeekR1 #AIUpdate #OpenSourceAI #ReasoningModels #AIBenchmarks #MachineLearning #ChinaAI #China
-
Microsoft Debuts Phi-4 Reasoning Models, Aiming for Big Performance Gains
#Microsoft #AI #Phi4 #SLM #LLM #OpenSourceAI #ReasoningModels #GenAI #MachineLearning
-
🤖 AI
🔴 OpenAI Unveils o3 & o4-mini Reasoning Models🔸 o3 outperforms all models in math, coding & visual tasks; o4-mini balances price & power.
🔸 First OpenAI models to "think with images" — can analyze blurry PDFs or sketches.
🔸 Both run Python, browse the web, and will be accessible via APIs & ChatGPT. -
Microsoft Adds OpenAI o3, o4-mini to Azure & GitHub
#AI #OpenAI #Microsoft #Azure #GitHub #o3 #o4mini #LLMa #ReasoningModels #CloudComputing
https://winbuzzer.com/2025/04/17/microsoft-adds-openai-o3-o4-mini-to-azure-github-xcxwbn/
-
OpenAI is set to launch GPT-4.1 and reasoning models o3/o4-mini soon, reversing earlier plans and delaying GPT-5 amidst capacity issues
#OpenAI #GPT4 #GPT4o #GPT4_1 #AI #GenAI #LLMs #ReasoningModels #o3 #o4mini #AIModels #ChatGPT #SamAltman
https://winbuzzer.com/2025/04/10/openai-readies-gpt-4-1-o3-o4-mini-launch-expected-next-week-xcxwbn/
-
Oh, the irony! An article on "reasoning models" that can't reason its way past a #JavaScript prompt. 🤖🧠✨ Maybe it should model how to enable #cookies first before philosophizing! 🍪🙄
https://www.anthropic.com/research/reasoning-models-dont-say-think #irony #reasoningmodels #techhumor #codingstruggles #HackerNews #ngated -
Reasoning models don't always say what they think
https://www.anthropic.com/research/reasoning-models-dont-say-think
#HackerNews #ReasoningModels #AIResearch #CognitiveScience #MachineLearning #TechInsights
-
Apparently AI reasoning models like Deepseek-R1 and OpenAI o1 suffer from "underthinking", where they abandon promising solutions too quickly, leading to inefficient resource use. To address this, a "thought switching penalty" (TIP) was developed, which improved accuracy across math and science problems.
-
O3-mini is now available to all ChatGPT users, giving free users their first chance to try OpenAI's reasoning models! 🧠🚀 #ChatGPT #OpenAI #AI #ReasoningModels #TechNews #ArtificialIntelligence #MachineLearning #AICommunity #FreeAccess
-
Im #Newsletter habe ich ein paar Gedanken und... Thesen? Beobachtungen? zu #DeepSeek aufgeschrieben. https://internetobservatorium.substack.com/p/aus-dem-internet-observatorium-123 #AI #KI #KünstlicheIntelligenz #ReasoningModels #ChinaTech
-
The Chinese firm said training the model cost just $5.6 million. Alibaba Cloud followed with a new generative AI model, while Microsoft alleges DeepSeek ‘distilled’ OpenAI’s work.#artificialintelligence #chatgpt #deepseek #deepseekr1 #deepseek-v3 #generativeai #Microsoft #nvidia #openai #reasoningmodels
DeepSeek Chatbot Beats OpenAI on App Store Leaderboard -
»#OpenAI trained #o1 and #o3 to 'think' about its #safetypolicy: outlining the company’s latest way to ensure #AI #reasoningmodels stay aligned with the #values of their #humandevelopers.« https://techcrunch.com/2024/12/22/openai-trained-o1-and-o3-to-think-about-its-safety-policy/?eicker.news #tech #media
-
»#OpenAI trained #o1 and #o3 to 'think' about its #safetypolicy: outlining the company’s latest way to ensure #AI #reasoningmodels stay aligned with the #values of their #humandevelopers.« https://techcrunch.com/2024/12/22/openai-trained-o1-and-o3-to-think-about-its-safety-policy/?eicker.news #tech #media
-
»#OpenAI trained #o1 and #o3 to 'think' about its #safetypolicy: outlining the company’s latest way to ensure #AI #reasoningmodels stay aligned with the #values of their #humandevelopers.« https://techcrunch.com/2024/12/22/openai-trained-o1-and-o3-to-think-about-its-safety-policy/?eicker.news #tech #media
-
»#OpenAI trained #o1 and #o3 to 'think' about its #safetypolicy: outlining the company’s latest way to ensure #AI #reasoningmodels stay aligned with the #values of their #humandevelopers.« https://techcrunch.com/2024/12/22/openai-trained-o1-and-o3-to-think-about-its-safety-policy/?eicker.news #tech #media
-
»#OpenAI trained #o1 and #o3 to 'think' about its #safetypolicy: outlining the company’s latest way to ensure #AI #reasoningmodels stay aligned with the #values of their #humandevelopers.« https://techcrunch.com/2024/12/22/openai-trained-o1-and-o3-to-think-about-its-safety-policy/?eicker.news #tech #media
-
OpenAI's o1 marks a major shift in the AI industry, moving away from prediction-based LLMs to reasoning models that aim to overcome their limitations. 🔍🤖 #OpenAI #AI #MachineLearning #ReasoningModels #ArtificialIntelligence #TechInnovation #AIShift