#sparseattention — Public Fediverse posts on home.social

deepseek @[email protected] · 2026-02-25 · 00:02 UTC

Understand DeepSeek V3.2: Pushing the Frontier of Open LLMs Recently, I joined the MLSys 2026 NVIDIA competition track! So I’m trying to understand DeepSeek V3.2, sparse attention, and learn GPU...

#gpu #sparse-attention #llm #machine-learning #deepseek

Origin | Interest | Match

#gpu #sparseattention #llm #machinelearning #deepseek

AIagent.at 🤖 AI News @[email protected] · 2026-02-12 · 04:08 UTC

#ZAI: #GLM5, a new large language model, is designed for #complexsystemsengineering and long-horizon agentic tasks. It boasts 744 billion parameters and integrates #DeepSeek #SparseAttention for improved efficiency. GLM-5 outperforms previous models on various benchmarks, including #reasoning, #coding, and #agentictasks, and is open-sourced for wider accessibility. https://z.ai/blog/glm-5?AIagents.at #AIagent #AI #ML #NLP #LLM #GenAI

#zai #glm5 #complexsystemsengineering #deepseek #sparseattention #reasoning

AIagent.at 🤖 AI News @[email protected] · 2026-02-12 · 04:08 UTC

#ZAI: #GLM5, a new large language model, is designed for #complexsystemsengineering and long-horizon agentic tasks. It boasts 744 billion parameters and integrates #DeepSeek #SparseAttention for improved efficiency. GLM-5 outperforms previous models on various benchmarks, including #reasoning, #coding, and #agentictasks, and is open-sourced for wider accessibility. https://z.ai/blog/glm-5?AIagents.at #AIagent #AI #ML #NLP #LLM #GenAI

#zai #glm5 #complexsystemsengineering #deepseek #sparseattention #reasoning

AIagent.at 🤖 AI News @[email protected] · 2026-02-12 · 04:08 UTC

#ZAI: #GLM5, a new large language model, is designed for #complexsystemsengineering and long-horizon agentic tasks. It boasts 744 billion parameters and integrates #DeepSeek #SparseAttention for improved efficiency. GLM-5 outperforms previous models on various benchmarks, including #reasoning, #coding, and #agentictasks, and is open-sourced for wider accessibility. https://z.ai/blog/glm-5?AIagents.at #AIagent #AI #ML #NLP #LLM #GenAI

#zai #glm5 #complexsystemsengineering #deepseek #sparseattention #reasoning

AIagent.at 🤖 AI News @[email protected] · 2026-02-12 · 04:08 UTC

#ZAI: #GLM5, a new large language model, is designed for #complexsystemsengineering and long-horizon agentic tasks. It boasts 744 billion parameters and integrates #DeepSeek #SparseAttention for improved efficiency. GLM-5 outperforms previous models on various benchmarks, including #reasoning, #coding, and #agentictasks, and is open-sourced for wider accessibility. https://z.ai/blog/glm-5?AIagents.at #AIagent #AI #ML #NLP #LLM #GenAI

#genai #llm #nlp #ml #ai #aiagent

AIagent.at 🤖 AI News @[email protected] · 2026-02-12 · 04:08 UTC

#ZAI: #GLM5, a new large language model, is designed for #complexsystemsengineering and long-horizon agentic tasks. It boasts 744 billion parameters and integrates #DeepSeek #SparseAttention for improved efficiency. GLM-5 outperforms previous models on various benchmarks, including #reasoning, #coding, and #agentictasks, and is open-sourced for wider accessibility. https://z.ai/blog/glm-5?AIagents.at #AIagent #AI #ML #NLP #LLM #GenAI