#llmreasoning — Public Fediverse posts on home.social

AI Daily Post @[email protected] · 2026-02-10 · 17:13 UTC

New research shows AI agents can map an entire plan, execute each step, then pause to reflect and re‑plan if needed. This iterative loop boosts LLM reasoning and autonomous problem solving, bringing us closer to truly self‑directed agents. Dive into the details of this planning‑reflection pattern and its open‑source implications. #AIAgents #IterativeLearning #LLMReasoning #AutonomousAgents

🔗 https://aidailypost.com/news/ai-agents-map-full-plans-execute-steps-then-pause-replan-if-needed

#aiagents #iterativelearning #llmreasoning #autonomousagents

AI Daily Post @[email protected] · 2026-02-10 · 17:13 UTC

New research shows AI agents can map an entire plan, execute each step, then pause to reflect and re‑plan if needed. This iterative loop boosts LLM reasoning and autonomous problem solving, bringing us closer to truly self‑directed agents. Dive into the details of this planning‑reflection pattern and its open‑source implications. #AIAgents #IterativeLearning #LLMReasoning #AutonomousAgents

🔗 https://aidailypost.com/news/ai-agents-map-full-plans-execute-steps-then-pause-replan-if-needed

#autonomousagents #llmreasoning #iterativelearning #aiagents

AI Daily Post @[email protected] · 2026-02-10 · 17:13 UTC

New research shows AI agents can map an entire plan, execute each step, then pause to reflect and re‑plan if needed. This iterative loop boosts LLM reasoning and autonomous problem solving, bringing us closer to truly self‑directed agents. Dive into the details of this planning‑reflection pattern and its open‑source implications. #AIAgents #IterativeLearning #LLMReasoning #AutonomousAgents

🔗 https://aidailypost.com/news/ai-agents-map-full-plans-execute-steps-then-pause-replan-if-needed

#aiagents #iterativelearning #llmreasoning #autonomousagents

AI Daily Post @[email protected] · 2026-01-17 · 19:10 UTC

RLVR promises faster sampling but leaves reasoning untouched—base LLMs still carry the heavy‑lifting of trajectories. The paper (NeurIPS 2025) shows that gains come from smarter teacher‑distillation and minor architectural tweaks, not a new reasoning engine. Curious how sampling efficiency separates from true understanding? Dive into the details. #RLVR #SamplingEfficiency #LLMReasoning #NeurIPS2025

🔗 https://aidailypost.com/news/rlvr-lifts-sampling-efficiency-not-reasoning-base-models-hold

#rlvr #samplingefficiency #llmreasoning #neurips2025

deepseek @[email protected] · 2025-12-19 · 23:07 UTC

Quoting Andrej Karpathy In 2025, Reinforcement Learning from Verifiable Rewards (RLVR) emerged as the de facto new major stage to add to this mix. By training LLMs against automatically verifiable ...

#andrej-karpathy #llm #generative-ai #llm-reasoning #definitions #ai #llms #deepseek

Origin | Interest | Match

#andrejkarpathy #llm #generativeai #llmreasoning #definitions #ai

deepseek @[email protected] · 2025-11-27 · 15:59 UTC

deepseek-ai/DeepSeek-Math-V2 deepseek-ai/DeepSeek-Math-V2 New on Hugging Face, a specialist mathematical reasoning LLM from DeepSeek. This is their entry in the space previously dominated by propri...

#mathematics #ai #generative-ai #llms #llm-reasoning #deepseek #llm-release #ai-in-china

Origin | Interest | Match

#mathematics #ai #generativeai #llms #llmreasoning #deepseek

deepseek @[email protected] · 2025-11-06 · 23:53 UTC

Kimi K2 Thinking Kimi K2 Thinking Chinese AI lab Moonshot's Kimi K2 established itself as one of the largest open weight models - 1 trillion parameters - back in July . They've now released...

#ai #generative-ai #llms #llm #mlx #pelican-riding-a-bicycle #llm-reasoning #llm-release #openrouter #ai-in-china #artificial-analysis

Origin | Interest | Match

#ai #generativeai #llms #llm #mlx #pelicanridingabicycle

deepseek @[email protected] · 2025-11-06 · 23:53 UTC

Kimi K2 Thinking Kimi K2 Thinking Chinese AI lab Moonshot's Kimi K2 established itself as one of the largest open weight models - 1 trillion parameters - back in July . They've now released...

#ai #generative-ai #llms #llm #mlx #pelican-riding-a-bicycle #llm-reasoning #llm-release #openrouter #ai-in-china #artificial-analysis

Origin | Interest | Match

#ai #generativeai #llms #llm #mlx #pelicanridingabicycle

deepseek @[email protected] · 2025-11-06 · 23:53 UTC

Kimi K2 Thinking Kimi K2 Thinking Chinese AI lab Moonshot's Kimi K2 established itself as one of the largest open weight models - 1 trillion parameters - back in July . They've now released...

#ai #generative-ai #llms #llm #pelican-riding-a-bicycle #llm-reasoning #llm-release #openrouter #ai-in-china #artificial-analysis #moonshot

Origin | Interest | Match

#ai #generativeai #llms #llm #pelicanridingabicycle #llmreasoning

WetHat💦 @[email protected] · 2025-04-16 · 16:06 UTC

Simple Prompt Tweaks Derail LLM Reasoning - MarkTechPost

➡️ MIT researchers analyzed how input changes impact the response quality of 13 prominent LLMs.
➡️Prompt perturbations included irrelevant contexts, misleading (pathological) instructions, and a mix of additional yet unnecessary details.
➡️Quality dropped substantially, with average declines of up to 55.89% for irrelevant contexts.

https://www.marktechpost.com/2025/04/15/from-logic-to-confusion-mit-researchers-show-how-simple-prompt-tweaks-derail-llm-reasoning/

#AI #PropmtEngineering #LLMReasoning

#ai #propmtengineering #llmreasoning

WetHat💦 @WetHat · 2025-04-16 · 16:06 UTC

Simple Prompt Tweaks Derail LLM Reasoning - MarkTechPost

➡️ MIT researchers analyzed how input changes impact the response quality of 13 prominent LLMs.
➡️Prompt perturbations included irrelevant contexts, misleading (pathological) instructions, and a mix of additional yet unnecessary details.
➡️Quality dropped substantially, with average declines of up to 55.89% for irrelevant contexts.

https://www.marktechpost.com/2025/04/15/from-logic-to-confusion-mit-researchers-show-how-simple-prompt-tweaks-derail-llm-reasoning/

#AI #PropmtEngineering #LLMReasoning

#ai #propmtengineering #llmreasoning

WetHat💦 @[email protected] · 2025-04-16 · 16:06 UTC

Simple Prompt Tweaks Derail LLM Reasoning - MarkTechPost

➡️ MIT researchers analyzed how input changes impact the response quality of 13 prominent LLMs.
➡️Prompt perturbations included irrelevant contexts, misleading (pathological) instructions, and a mix of additional yet unnecessary details.
➡️Quality dropped substantially, with average declines of up to 55.89% for irrelevant contexts.

https://www.marktechpost.com/2025/04/15/from-logic-to-confusion-mit-researchers-show-how-simple-prompt-tweaks-derail-llm-reasoning/

#AI #PropmtEngineering #LLMReasoning

#ai #propmtengineering #llmreasoning

WetHat💦 @[email protected] · 2025-04-16 · 16:06 UTC

Simple Prompt Tweaks Derail LLM Reasoning - MarkTechPost

➡️ MIT researchers analyzed how input changes impact the response quality of 13 prominent LLMs.
➡️Prompt perturbations included irrelevant contexts, misleading (pathological) instructions, and a mix of additional yet unnecessary details.
➡️Quality dropped substantially, with average declines of up to 55.89% for irrelevant contexts.

https://www.marktechpost.com/2025/04/15/from-logic-to-confusion-mit-researchers-show-how-simple-prompt-tweaks-derail-llm-reasoning/

#AI #PropmtEngineering #LLMReasoning

#llmreasoning #propmtengineering #ai

WetHat💦 @[email protected] · 2025-04-16 · 16:06 UTC

Simple Prompt Tweaks Derail LLM Reasoning - MarkTechPost

➡️ MIT researchers analyzed how input changes impact the response quality of 13 prominent LLMs.
➡️Prompt perturbations included irrelevant contexts, misleading (pathological) instructions, and a mix of additional yet unnecessary details.
➡️Quality dropped substantially, with average declines of up to 55.89% for irrelevant contexts.

https://www.marktechpost.com/2025/04/15/from-logic-to-confusion-mit-researchers-show-how-simple-prompt-tweaks-derail-llm-reasoning/

#AI #PropmtEngineering #LLMReasoning

#ai #propmtengineering #llmreasoning

LavX News @[email protected] · 2025-01-09 · 13:34 UTC

Unlocking Human-like Reasoning in AI: The Meta Chain-of-Thought Breakthrough

In an ambitious leap forward, researchers introduce the Meta Chain-of-Thought framework, aiming to enhance reasoning capabilities in large language models (LLMs). This innovative approach not only bui...

https://news.lavx.hu/article/unlocking-human-like-reasoning-in-ai-the-meta-chain-of-thought-breakthrough

#news #tech #MetaChainOfThought #LLMReasoning #AIAdvancements

#news #tech #metachainofthought #llmreasoning #aiadvancements

LavX News @[email protected] · 2025-01-09 · 13:34 UTC

Unlocking Human-like Reasoning in AI: The Meta Chain-of-Thought Breakthrough

In an ambitious leap forward, researchers introduce the Meta Chain-of-Thought framework, aiming to enhance reasoning capabilities in large language models (LLMs). This innovative approach not only bui...

https://news.lavx.hu/article/unlocking-human-like-reasoning-in-ai-the-meta-chain-of-thought-breakthrough

#news #tech #MetaChainOfThought #LLMReasoning #AIAdvancements

#news #tech #metachainofthought #llmreasoning #aiadvancements

LavX News @[email protected] · 2025-01-09 · 13:34 UTC

Unlocking Human-like Reasoning in AI: The Meta Chain-of-Thought Breakthrough

In an ambitious leap forward, researchers introduce the Meta Chain-of-Thought framework, aiming to enhance reasoning capabilities in large language models (LLMs). This innovative approach not only bui...

https://news.lavx.hu/article/unlocking-human-like-reasoning-in-ai-the-meta-chain-of-thought-breakthrough

#news #tech #MetaChainOfThought #LLMReasoning #AIAdvancements

#news #tech #metachainofthought #llmreasoning #aiadvancements

LavX News @[email protected] · 2025-01-09 · 13:34 UTC

Unlocking Human-like Reasoning in AI: The Meta Chain-of-Thought Breakthrough

In an ambitious leap forward, researchers introduce the Meta Chain-of-Thought framework, aiming to enhance reasoning capabilities in large language models (LLMs). This innovative approach not only bui...

https://news.lavx.hu/article/unlocking-human-like-reasoning-in-ai-the-meta-chain-of-thought-breakthrough

#news #tech #MetaChainOfThought #LLMReasoning #AIAdvancements

#aiadvancements #llmreasoning #metachainofthought #tech #news

LavX News @[email protected] · 2025-01-09 · 13:34 UTC

Unlocking Human-like Reasoning in AI: The Meta Chain-of-Thought Breakthrough

In an ambitious leap forward, researchers introduce the Meta Chain-of-Thought framework, aiming to enhance reasoning capabilities in large language models (LLMs). This innovative approach not only bui...

https://news.lavx.hu/article/unlocking-human-like-reasoning-in-ai-the-meta-chain-of-thought-breakthrough

#news #tech #MetaChainOfThought #LLMReasoning #AIAdvancements

#news #tech #metachainofthought #llmreasoning #aiadvancements

Anand Philip @[email protected] · 2024-10-21 · 16:26 UTC

Four papers on LLM reasoning summarized by @melaniemitchell https://aiguide.substack.com/p/the-llm-reasoning-debate-heats-up along with the background in her latest. Of these, the chain of thought prompting paper's attempt to identify sources of predictions (memorization vs reasoning] is very interesting, although chaotic. Stats people might hate the conclusions. #LLMReasoning #LLMResearch

#llmreasoning #llmresearch

Anand Philip @[email protected] · 2024-10-21 · 16:26 UTC

Four papers on LLM reasoning summarized by @melaniemitchell https://aiguide.substack.com/p/the-llm-reasoning-debate-heats-up along with the background in her latest. Of these, the chain of thought prompting paper's attempt to identify sources of predictions (memorization vs reasoning] is very interesting, although chaotic. Stats people might hate the conclusions. #LLMReasoning #LLMResearch

#llmreasoning #llmresearch

Anand Philip @[email protected] · 2024-10-21 · 16:26 UTC

Four papers on LLM reasoning summarized by @melaniemitchell https://aiguide.substack.com/p/the-llm-reasoning-debate-heats-up along with the background in her latest. Of these, the chain of thought prompting paper's attempt to identify sources of predictions (memorization vs reasoning] is very interesting, although chaotic. Stats people might hate the conclusions. #LLMReasoning #LLMResearch

#llmreasoning #llmresearch

Anand Philip @[email protected] · 2024-10-21 · 16:26 UTC

Four papers on LLM reasoning summarized by @melaniemitchell https://aiguide.substack.com/p/the-llm-reasoning-debate-heats-up along with the background in her latest. Of these, the chain of thought prompting paper's attempt to identify sources of predictions (memorization vs reasoning] is very interesting, although chaotic. Stats people might hate the conclusions. #LLMReasoning #LLMResearch

#llmresearch #llmreasoning

Anand Philip @[email protected] · 2024-10-21 · 16:26 UTC

Four papers on LLM reasoning summarized by @melaniemitchell https://aiguide.substack.com/p/the-llm-reasoning-debate-heats-up along with the background in her latest. Of these, the chain of thought prompting paper's attempt to identify sources of predictions (memorization vs reasoning] is very interesting, although chaotic. Stats people might hate the conclusions. #LLMReasoning #LLMResearch

#llmreasoning #llmresearch