#reasoningmodels — Public Fediverse posts on home.social

United States News Beep @[email protected] · 2026-05-21 · 02:40 UTC

OpenAI claims it solved an 80-year-old math problem — for real this time

OpenAI claims its new reasoning model has produced an original mathematical proof disproving a famous unsolved conjecture in…
#NewsBeep #News #US #USA #UnitedStates #UnitedStatesOfAmerica #Artificialintelligence #AI #ArtificialIntelligence #ChatGPT #erdosproblems #OpenAI #reasoningmodels #Technology
https://www.newsbeep.com/us/655490/

#newsbeep #news #us #usa #unitedstates #unitedstatesofamerica

United States News Beep @[email protected] · 2026-05-21 · 02:40 UTC

OpenAI claims it solved an 80-year-old math problem — for real this time

OpenAI claims its new reasoning model has produced an original mathematical proof disproving a famous unsolved conjecture in…
#NewsBeep #News #US #USA #UnitedStates #UnitedStatesOfAmerica #Artificialintelligence #AI #ArtificialIntelligence #ChatGPT #erdosproblems #OpenAI #reasoningmodels #Technology
https://www.newsbeep.com/us/655490/

#technology #reasoningmodels #openai #erdosproblems #chatgpt #ai

Marcus Schuler @[email protected] · 2026-04-03 · 07:02 UTC

Arcee AI released Trinity-Large-Thinking, a 400B parameter open-source reasoning model that scores within 2 points of Claude Opus on PinchBench while costing 96% less at $0.90 per million tokens. Uses sparse architecture activating only 13B parameters per token. Trained for $20M by 30-person team. #OpenSource #AI #ReasoningModels

https://www.implicator.ai/arcee-ai-releases-400b-open-reasoning-model-that-rivals-claude-at-96-lower-cost/

#opensource #ai #reasoningmodels

AI Daily Post @[email protected] · 2026-02-11 · 11:27 UTC

xAI’s co‑founder exits keep coming, while Lambda outlines a 2025 shift toward bigger context windows, multimodal reasoning models and open‑source inference for AI production. What could this mean for the future of machine learning? Read on for the full story. #AIProduction #ReasoningModels #MultimodalAI #OpenSourceInference

🔗 https://aidailypost.com/news/xai-co-founder-departures-persist-lambda-outlines-2025-ai-production

#aiproduction #reasoningmodels #multimodalai #opensourceinference

TechGlimmer @[email protected] · 2026-01-21 · 23:18 UTC

AI that thinks instead of guessing?

Reasoning models use techniques like chain of thought and tree of thought to decompose problems, explore alternatives, and choose better answers, often at the cost of more compute and latency.

A practical explainer:
🔗 https://techglimmer.io/what-is-ai-thinking-reasoning-models/

#AI #ReasoningModels #ChainOfThought #TreeOfThought #GenAI #FediTech #MachineLearning

#ai #reasoningmodels #chainofthought #treeofthought #genai #feditech

tech news ᳇ eicker.news @[email protected] · 2026-01-01 · 10:29 UTC

2025 saw significant advancements in #LLMs, particularly in the areas of #reasoning and #agent based systems. #Reasoningmodels, capable of breaking down #complextasks and utilising tools, revolutionised #coding and #search. The year witnessed the rise of #codingagents, exemplified by #ClaudeCode, which can autonomously write, execute, and refine code. https://simonwillison.net/2025/Dec/31/the-year-in-llms/?eicker.news #tech #media #news

#llms #reasoning #agent #reasoningmodels #complextasks #coding

tech news ᳇ eicker.news @[email protected] · 2026-01-01 · 10:29 UTC

2025 saw significant advancements in #LLMs, particularly in the areas of #reasoning and #agent based systems. #Reasoningmodels, capable of breaking down #complextasks and utilising tools, revolutionised #coding and #search. The year witnessed the rise of #codingagents, exemplified by #ClaudeCode, which can autonomously write, execute, and refine code. https://simonwillison.net/2025/Dec/31/the-year-in-llms/?eicker.news #tech #media #news

#llms #reasoning #agent #reasoningmodels #complextasks #coding

tech news ᳇ eicker.news @[email protected] · 2026-01-01 · 10:29 UTC

2025 saw significant advancements in #LLMs, particularly in the areas of #reasoning and #agent based systems. #Reasoningmodels, capable of breaking down #complextasks and utilising tools, revolutionised #coding and #search. The year witnessed the rise of #codingagents, exemplified by #ClaudeCode, which can autonomously write, execute, and refine code. https://simonwillison.net/2025/Dec/31/the-year-in-llms/?eicker.news #tech #media #news

#llms #reasoning #agent #reasoningmodels #complextasks #coding

tech news ᳇ eicker.news @[email protected] · 2026-01-01 · 10:29 UTC

2025 saw significant advancements in #LLMs, particularly in the areas of #reasoning and #agent based systems. #Reasoningmodels, capable of breaking down #complextasks and utilising tools, revolutionised #coding and #search. The year witnessed the rise of #codingagents, exemplified by #ClaudeCode, which can autonomously write, execute, and refine code. https://simonwillison.net/2025/Dec/31/the-year-in-llms/?eicker.news #tech #media #news

#news #media #tech #claudecode #codingagents #search

tech news ᳇ eicker.news @[email protected] · 2026-01-01 · 10:29 UTC

2025 saw significant advancements in #LLMs, particularly in the areas of #reasoning and #agent based systems. #Reasoningmodels, capable of breaking down #complextasks and utilising tools, revolutionised #coding and #search. The year witnessed the rise of #codingagents, exemplified by #ClaudeCode, which can autonomously write, execute, and refine code. https://simonwillison.net/2025/Dec/31/the-year-in-llms/?eicker.news #tech #media #news

#llms #reasoning #agent #reasoningmodels #complextasks #coding

Winbuzzer @[email protected] · 2025-12-20 · 11:54 UTC

https://winbuzzer.com/2025/12/20/openai-gpt-5-thinking-models-are-the-most-monitarable-models-to-date-xcxwbn/

OpenAI: GPT-5 Thinking Models Are The Most "Monitarable" Models To Date

#AI #OpenAI #AISafety #LLM #MachineLearning #GPT5 #DeepMind #AIResearch #ChainOfThought #Monitorability #AIAlignment #ReasoningModels

#ai #openai #aisafety #llm #machinelearning #gpt5

Winbuzzer @[email protected] · 2025-12-20 · 11:54 UTC

https://winbuzzer.com/2025/12/20/openai-gpt-5-thinking-models-are-the-most-monitarable-models-to-date-xcxwbn/

OpenAI: GPT-5 Thinking Models Are The Most "Monitarable" Models To Date

#AI #OpenAI #AISafety #LLM #MachineLearning #GPT5 #DeepMind #AIResearch #ChainOfThought #Monitorability #AIAlignment #ReasoningModels

#ai #openai #aisafety #llm #machinelearning #gpt5

Winbuzzer @[email protected] · 2025-12-20 · 11:54 UTC

https://winbuzzer.com/2025/12/20/openai-gpt-5-thinking-models-are-the-most-monitarable-models-to-date-xcxwbn/

OpenAI: GPT-5 Thinking Models Are The Most "Monitarable" Models To Date

#AI #OpenAI #AISafety #LLM #MachineLearning #GPT5 #DeepMind #AIResearch #ChainOfThought #Monitorability #AIAlignment #ReasoningModels

#ai #openai #aisafety #llm #machinelearning #gpt5

Winbuzzer @[email protected] · 2025-12-20 · 11:54 UTC

https://winbuzzer.com/2025/12/20/openai-gpt-5-thinking-models-are-the-most-monitarable-models-to-date-xcxwbn/

OpenAI: GPT-5 Thinking Models Are The Most "Monitarable" Models To Date

#AI #OpenAI #AISafety #LLM #MachineLearning #GPT5 #DeepMind #AIResearch #ChainOfThought #Monitorability #AIAlignment #ReasoningModels

#reasoningmodels #aialignment #monitorability #chainofthought #airesearch #deepmind

Winbuzzer @[email protected] · 2025-12-20 · 11:54 UTC

https://winbuzzer.com/2025/12/20/openai-gpt-5-thinking-models-are-the-most-monitarable-models-to-date-xcxwbn/

OpenAI: GPT-5 Thinking Models Are The Most "Monitarable" Models To Date

#AI #OpenAI #AISafety #LLM #MachineLearning #GPT5 #DeepMind #AIResearch #ChainOfThought #Monitorability #AIAlignment #ReasoningModels

#ai #openai #aisafety #llm #machinelearning #gpt5

Hacker News @[email protected] · 2025-10-31 · 09:40 UTC

Reasoning Models Reason Well, Until They Don't

https://arxiv.org/abs/2510.22371

#HackerNews #ReasoningModels #ReasonWell #AIResearch #MachineLearning #HackerNews

#hackernews #reasoningmodels #reasonwell #airesearch #machinelearning

Miguel Afonso Caetano @[email protected] · 2025-09-05 · 20:53 UTC

"The point is that with each advance in AI, new hurdles become apparent; when one missing aspect of “intelligence” is filled in, we find ourselves bumping up against another gap. When I speculated about GPT-5 last year, it didn’t occur to me to question whether it would know how to set priorities, because the models of the time weren’t even capable enough for that to be a limiting factor. In a post from November, AI is Racing Forward – on a Very Long Road, I wrote:

…the real challenges may be things that we can’t easily anticipate right now, weaknesses that we will only start to put our finger on when we observe [future models] performing astonishing feats and yet somehow still not being able to write that tightly-plotted novel.

In April 2024, it seemed like agentic AI was going to be the next big thing. The ensuing 16 months have brought enormous progress on many fronts, but very little progress on real-world agency. With projects like AI Village shining a light on the profound weakness of current AI agents, I think robust real-world capability is still years away."

https://secondthoughts.ai/p/gpt-5-the-case-of-the-missing-agent

#AI #GenerativeAI #LLMs #Chatbots #AIAgents #AgenticAI #ReasoningModels

#ai #generativeai #llms #chatbots #aiagents #agenticai

Dr. Thompson @[email protected] · 2025-09-01 · 20:52 UTC

🧠 What if you could tell AI how much to think before answering?
Seed-OSS 36B gives builders a thinking budget knob + 512K context window—control depth vs speed like never before. ⚡

👉 See how it changes product SLAs, costs, and user experience:
https://medium.com/@rogt.x1997/seed-oss-36b-a-tweakable-reasoning-engine-for-long-context-work-66aa05a72548

#AI #ReasoningModels #LongContext
https://medium.com/@rogt.x1997/seed-oss-36b-a-tweakable-reasoning-engine-for-long-context-work-66aa05a72548

#ai #reasoningmodels #longcontext

N-gated Hacker News @[email protected] · 2025-06-14 · 20:28 UTC

Seven so-called "replies" to Apple's paper on reasoning models, or as I like to call them, seven exercises in missing the point entirely. 📚🤦‍♂️ It's almost like a bad magic trick: look over here at these rebuttals while we pretend the original issue just vanishes! 🎩✨
https://garymarcus.substack.com/p/seven-replies-to-the-viral-apple #AppleReplies #ReasoningModels #MissingThePoint #BadMagicTrick #TechCritique #HackerNews #ngated

#applereplies #reasoningmodels #missingthepoint #badmagictrick #techcritique #hackernews

Winbuzzer @[email protected] · 2025-06-11 · 08:37 UTC

OpenAI Releases new o3-Pro AI Model: A High-Stakes Bet on AI Reliability

#AI #OpenAI #o3pro #LLM #TechNews #ReasoningModels #ChatGPT #EnterpriseAI #AIEthics

https://winbuzzer.com/2025/06/11/openai-releases-new-o3-pro-ai-model-a-high-stakes-bet-on-ai-reliability-xcxwbn/

#ai #openai #o3pro #llm #technews #reasoningmodels

Hacker News @[email protected] · 2025-06-08 · 11:47 UTC

The Illusion of Thinking: Strengths and Limitations of Reasoning Models

https://machinelearning.apple.com/research/illusion-of-thinking

#HackerNews #IllusionOfThinking #ReasoningModels #StrengthsAndLimitations #AIResearch #MachineLearning

#hackernews #illusionofthinking #reasoningmodels #strengthsandlimitations #airesearch #machinelearning

Matt Williams @technovangelist · 2025-06-07 · 20:47 UTC

No more guessing games! 🕵️‍♂️ #ollama's new 'think' feature cleanly separates the model's internal thinking from the content. Easy to enable - just 'think': true in your API request. #AIdevelopment #ReasoningModels https://youtu.be/yBD598s5g8c

#ollama #aidevelopment #reasoningmodels

Winbuzzer @[email protected] · 2025-05-29 · 14:23 UTC

DeepSeek R1 AI Model Update Boosts Reasoning, Catching up With OpenAI o3 and Gemini 2.5 Pro

#AI #DeepSeek #GenAI #LLM #DeepSeekR1 #AIUpdate #OpenSourceAI #ReasoningModels #AIBenchmarks #MachineLearning #ChinaAI #China

https://winbuzzer.com/2025/05/29/deepseek-r1-ai-model-update-boosts-reasoning-catching-up-with-openai-o3-and-gemini-2-5-pro-xcxwbn/

#ai #deepseek #genai #llm #deepseekr1 #aiupdate

Winbuzzer @[email protected] · 2025-05-01 · 11:45 UTC

Microsoft Debuts Phi-4 Reasoning Models, Aiming for Big Performance Gains

#Microsoft #AI #Phi4 #SLM #LLM #OpenSourceAI #ReasoningModels #GenAI #MachineLearning

https://winbuzzer.com/2025/05/01/microsoft-debuts-phi-4-reasoning-models-aiming-for-big-performance-gains-xcxwbn/

#microsoft #ai #phi4 #slm #llm #opensourceai

Winbuzzer @[email protected] · 2025-04-19 · 08:27 UTC