#deepseek-v3 — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #deepseek-v3, aggregated by home.social.
-
DeepSeek、史上最長の13時間障害——次世代モデル「V4」、いよいよ来るのか
-
DeepSeek-V3 from Scratch: Mixture of Experts (MoE) Table of Contents DeepSeek-V3 from Scratch: Mixture of Experts (MoE) The Scaling Challenge in Neural Networks Mixture of Experts (MoE): Mathematic...
#Deep #Learning #DeepSeek #Machine #Learning #Neural #Networks #Tutorial #deepseek-v3 #expert #routing
Origin | Interest | Match -
DeepSeek-V3 from Scratch: Mixture of Experts (MoE) Table of Contents DeepSeek-V3 from Scratch: Mixture of Experts (MoE) The Scaling Challenge in Neural Networks Mixture of Experts (MoE): Mathematic...
#Deep #Learning #DeepSeek #Machine #Learning #Neural #Networks #Tutorial #deepseek-v3 #expert #routing
Origin | Interest | Match -
Build DeepSeek-V3: Multi-Head Latent Attention (MLA) Architecture Table of Contents Build DeepSeek-V3: Multi-Head Latent Attention (MLA) Architecture The KV Cache Memory Problem in DeepSeek-V3 Mult...
#Deep #Learning #Large #Language #Models #PyTorch #Transformers #Tutorial #attention #mechanisms #deepseek-v3
Origin | Interest | Match -
DeepSeek-V3 Model: Theory, Config, and Rotary Positional Embeddings Table of Contents DeepSeek-V3 Model: Theory, Config, and Rotary Positional Embeddings Introduction to the DeepSeek-V3 Model The F...
#DeepSeek-V3 #KV #Cache #MultiHead #Latent #Attention #RoPE #Tutorial #deepseekv3 #kv #cache
Origin | Interest | Match -
Beating GPT-5: DeepSeekMath-V2 Self-Corrects Logic Errors Presentational View Introduction Mathematics with the aid of artificial intelligence, is advancing rapidly. Innovations such as informal th...
#ai-in-mathematics #deepseekmath-v2 #deepseek-v3 #open-source-ai-model #theorem-proving
Origin | Interest | Match -
🚀 Welcome GLM-4.6 the Latest flagship #opensource #AI #llm with advanced agentic, reasoning & coding capabilities
⚡ Performance improvements over #GLM45 with competitive advantages against #DeepSeekV3 and #ClaudeSonnet4 across 8 public benchmarks covering agents, reasoning & coding
🧵 👇
-
DeepSeek: Everything you need to know about the AI chatbot app
-
Насколько зацензурен и опасен DeepSeek?
Насколько предвзят искусственный интеллект? Принято ругать нейросети за трансляцию стереотипов человеческого мышления, которые были подсмотрены в датасетах предобучения. На деле ИИ куда более аккуратен, чем можно ожидать. Хороший пример — генерация фотографий бабочек. Как правило, дизайнеры-люди очень любят изображать бабочек в мёртвом виде. Дело в том, что энтомологи руководствуются строгими визуальными стандартами: вид сверху, расправленные на 180° крылья, чистый фон, симметрия.
https://habr.com/ru/articles/949540/
#DeepSeek #DeepSeekR1 #DeepSeekV3 #КНР #Китай #большие_языковые_модели #БЯМ #искусственный_интеллект #предвзятость #цензура
-
https://technologiesinternetz.blogspot.com/2025/08/deepseek-v31-vs-gpt-5-vs-claude-41.html
DeepSeek V3.1 vs GPT-5 vs Claude 4.1: Which LLM Delivers the Best Value to Users?
#deepseekv3.1 #gpt5 #claude4.1 #LLM
-
New DeepSeek-R1T-Chimera Model Merges R1 Reasoning With Efficiency of V3-0324
#AI #LLMs #DeepSeekR1 #DeepSeekV3 #Chimera #OpenSourceAI #TNGTech #MoE #MachineLearning #TechNews #GenAI
-
🧩 #Llama4Maverick nutzt 128 Experten für deutlich mehr Rechenleistung und schlägt sogar #GPT4o und #Gemini20 in Benchmarks – bei nur der Hälfte der aktiven Parameter von #DeepSeekv3.
🎓 Beide #KIModelle wurden mithilfe des riesigen Lehrmodells #Llama4 Behemoth trainiert, das mit 288 Milliarden aktiven Parametern zu den leistungsstärksten weltweit zählt.
👉 https://eicker.TV #Technik #Medien #Politik #Wirtschaft (2/2)
-
Benchmarks Find ‘DeepSeek-V3-0324 Is More Vulnerable Than Qwen2.5-Max’ – Source: www.techrepublic.com https://ciso2ciso.com/benchmarks-find-deepseek-v3-0324-is-more-vulnerable-than-qwen2-5-max-source-www-techrepublic-com/ #threatsandvulnerabilities #rssfeedpostgeneratorecho #ArtificialIntelligence #SecurityonTechRepublic #SecurityTechRepublic #CyberSecurityNews #Cybersecurity #AIsecurity #deepseekv3 #qwen25max #AImodels #DeepSeek #Security #Alibaba #News #AI
-
Studie: #KI #Chatbots sind beim Zitieren von #News unbrauchbar
https://www.derstandard.at/story/3000000261220/studie-ki-chatbots-sind-beim-zitieren-von-news-unbrauchbar"Untersucht wurden #ChatGPT Search (#OpenAI), #Perplexity, Perplexity Pro (Perplexity AI), #Gemini 2.0 Flash (#Google), #DeepseekV3 Search (#Deepseek), #Grok-2 Search, Grok-3 Search Beta (#xAI) sowie #Copilot (#Microsoft und OpenAI)."
"#Grok3 [...] lieferte gleich in 96 Prozent aller Fälle falsche Antworten." 🤣
-
DeepSeek releases DeepSeek-V3-0324 on Hugging Face!
#DeepSeek #AI #MachineLearning #DeepSeekV3 #HuggingFace #ArtificialIntelligence #AIModel
-
»Chinese #AIlab #DeepSeek just released the latest version of their enormous #DeepSeekv3 model: The license is #MIT (that's new - previous DeepSeek v3 had a custom license).« https://simonwillison.net/2025/Mar/24/deepseek/?eicker.news #tech #media
-
DeepSeek’s new V3-0324 AI model has launched quietly, offering efficient performance on a Mac Studio
#AI #DeepSeekV3 #DeepSeek #DeepSeekV30324 #GenAI #LLM #OpenSourceAI #AIModels
-
DeepSeek: ChatGPT killer or just another hype train? We compare it against ChatGPT and Gemini #apps #chatgpt #deepseek #deepseekr1 #deepseekv3 #digitallife #featured #gemini #geminiai #googlegemini #openai #video
-
DeepSeek und die Geschichte von Liang Wenfeng!
Gründung 2023
DeepSeek R1 übertrifft ChatGPT
Effiziente KI-Modelle
Weniger Ressourcen nötig#ai #ki #artificialintelligence #kuenstlicheintelligenz #deepseek #deepseekr1 #deepseekv3 #liangwenfeng #technologie
https://kinews24.de/deepseek-und-die-geschichte-von-lian-wenfeng/
-
🚀 DeepSeek V3 vs ChatGPT-4o: Which One Reigns Supreme?🤖
AI is evolving fast! 🏎️ DeepSeek V3 and ChatGPT-4o are two of the most powerful LLMs in 2025. But which one is better?
🔍 We compare:
✅ Accuracy & performance
✅ Multimodal capabilities
✅ Speed & efficiency
✅ Real-world applications📖 Read the full breakdown here:
https://radargit.com/2025/02/03/deepseek-v3-vs-chatgpt-4o-which-one-is-better/
Which AI model do you prefer? Comment below! 👇
#AI #DeepSeekV3 #ChatGPT4o #ArtificialIntelligence #Tech #MachineLearning #AICompari
-
Das wird noch etwas dauern. #ollama #deepseekv3
-
DeepSeek Locked Down Public Database Access That Exposed Chat History – Source: www.techrepublic.com https://ciso2ciso.com/deepseek-locked-down-public-database-access-that-exposed-chat-history-source-www-techrepublic-com/ #rssfeedpostgeneratorecho #ArtificialIntelligence #SecurityonTechRepublic #SecurityTechRepublic #CyberSecurityNews #SecurityResearch #databaseleakage #International #GenerativeAI #wizresearch #clickhouse #deepseekr1 #deepseekv3 #opensource #DeepSeek #openaio1 #Security #BigData
-
Research Firm Wiz Research began investigating DeepSeek soon after its generative AI took the tech world by storm.#artificialintelligence #clickhouse #databaseleakage #deepseek #deepseekr1 #deepseek-v3 #generativeai #openaio1 #security #securityresearch #wizresearch
DeepSeek Locked Down Public Database Access That Exposed Chat History -
"A key component of the success is that it is #opensource. #DeepSeek-V3 is on GitHub with detailed docs on how it can be replicated. This has fueled a rush of people to try to make their own models." https://baixacultura.org/2025/01/29/a-corrida-da-ia-ganha-um-novo-capitulo-chines-e-open-source/
A corrida da IA ganha um novo ... -
The Chinese firm said training the model cost just $5.6 million. Alibaba Cloud followed with a new generative AI model, while Microsoft alleges DeepSeek ‘distilled’ OpenAI’s work.#artificialintelligence #chatgpt #deepseek #deepseekr1 #deepseek-v3 #generativeai #Microsoft #nvidia #openai #reasoningmodels
DeepSeek Chatbot Beats OpenAI on App Store Leaderboard -
DeepSeek: China’s answer to ChatGPT is causing havoc, Nvidia loses nearly USD 600 bil in market cap #ai #apps #chatgpt #china #deepseek #deepseekr1 #deepseekv3 #digitallife #featured #news #tech
https://soyacincau.com/2025/01/28/deepseek-china-answer-to-chatgpt-nvidia-loses-nearly-600bil/