#deepseekv3 — Public Fediverse posts on home.social

Arint - SEO+KI @[email protected] · 2026-07-18 · 04:01 UTC

RT @dunik_7: TRANSLASATION: Ein Labor der Tsinghua-Universität hat ein Projekt auf GitHub veröffentlicht, das einen H100-Rack im Wert von 400.000 US-Dollar durch eine einzelne 24-GB-Grafikkarte ersetzt. Das Projekt heißt ktransformers, und der Trick ist fast schon lächerlich einfach: Die Experten, die Sie tatsächlich nutzen, bleiben auf der GPU, während die anderen auf der CPU warten, bis sie benötigt werden. / DeepSeek-V3 und R1 mit 139K Kontext in 24GB VRAM / bis zu 28-fache Geschwindigkeitssteigerung gegenüber dem Standard-Setup / Fine-Tuning von DeepSeek-V3 über vier RTX 4090 statt eines Rechenzentrums / entwickelt vom MADSys-Labor der Tsinghua-Universität, nicht von einem Startup mit einer Landing Page. Apache 2.0, bereits über 17.000 Sterne. - http://github.com/kvcache-ai/ktransformers merken.

mehr auf Arint.info

#AIResearch #DeepSeekV3 #ktransformers #MachineLearning #OpenSource #TsinghuaUniversity #arint_info

https://x.com/dunik_7/status/2078065378563887290#m

#airesearch #deepseekv3 #ktransformers #machinelearning #opensource #tsinghuauniversity

36Kr Japan | 最大級の中国テック・スタートアップ専門メディア @[email protected] · 2026-04-01 · 23:00 UTC

DeepSeek、史上最長の13時間障害——次世代モデル「V4」、いよいよ来るのか

https://web.brid.gy/r/https://36kr.jp/488383/

#スタートアップ #注目記事 #sns #生成ai #aiモデル #障害

36Kr Japan | 最大級の中国テック・スタートアップ専門メディア @[email protected] · 2026-04-01 · 23:00 UTC

DeepSeek、史上最長の13時間障害——次世代モデル「V4」、いよいよ来るのか

https://web.brid.gy/r/https://36kr.jp/488383/

#スタートアップ #注目記事 #sns #生成ai #aiモデル #障害

deepseek @[email protected] · 2026-03-23 · 12:45 UTC

DeepSeek-V3 from Scratch: Mixture of Experts (MoE) Table of Contents DeepSeek-V3 from Scratch: Mixture of Experts (MoE) The Scaling Challenge in Neural Networks Mixture of Experts (MoE): Mathematic...

#Deep #Learning #DeepSeek #Machine #Learning #Neural #Networks #Tutorial #deepseek-v3 #expert #routing

Origin | Interest | Match

#deep #learning #deepseek #machine #neural #networks

deepseek @[email protected] · 2026-03-23 · 12:45 UTC

DeepSeek-V3 from Scratch: Mixture of Experts (MoE) Table of Contents DeepSeek-V3 from Scratch: Mixture of Experts (MoE) The Scaling Challenge in Neural Networks Mixture of Experts (MoE): Mathematic...

#Deep #Learning #DeepSeek #Machine #Learning #Neural #Networks #Tutorial #deepseek-v3 #expert #routing

Origin | Interest | Match

#deep #learning #deepseek #machine #neural #networks

UNWIRE.HK @[email protected] · 2026-03-22 · 11:31 UTC

日本樂天推自家「AI 3.0」模型　源碼竟顯示使用 DeepSeek 基礎模型
樂天集團 (Rakuten) 3 月 17 日公開旗下最新日語大型語言模型「Rakuten AI 3.0」，惟技術人員隨即發現 Hugging Face 上的設定檔案顯示其架構與中國 AI 公司 DeepSeek 的 DeepSeek-V3 模型高度吻合，兼且發布時被指悄然移除 DeepSeek-V3 原有開源授權聲明，觸發開源社群強烈批評，樂天面對查詢時拒絕披露基礎模型來源，僅稱「非公開」。
#人工智能 #AI #DeepSeek #DeepSeek-V3
https://unwire.hk/2026/03/22/rakuten-ai-3-deepseek-v3-open-source-controversy/ai/?utm_source=rss&utm_medium=rss&utm_campaign=rakuten-ai-3-deepseek-v3-open-source-controversy

#rakuten #ai30 #deepseek #deepseekv3 #開源 #爭議

deepseek @[email protected] · 2026-03-16 · 12:45 UTC

Build DeepSeek-V3: Multi-Head Latent Attention (MLA) Architecture Table of Contents Build DeepSeek-V3: Multi-Head Latent Attention (MLA) Architecture The KV Cache Memory Problem in DeepSeek-V3 Mult...

#Deep #Learning #Large #Language #Models #PyTorch #Transformers #Tutorial #attention #mechanisms #deepseek-v3

Origin | Interest | Match

#deep #learning #large #language #models #pytorch

deepseek @[email protected] · 2026-03-09 · 12:45 UTC

DeepSeek-V3 Model: Theory, Config, and Rotary Positional Embeddings Table of Contents DeepSeek-V3 Model: Theory, Config, and Rotary Positional Embeddings Introduction to the DeepSeek-V3 Model The F...

#DeepSeek-V3 #KV #Cache #MultiHead #Latent #Attention #RoPE #Tutorial #deepseekv3 #kv #cache

Origin | Interest | Match

#deepseekv3 #kv #cache #multihead #latent #attention

deepseek @[email protected] · 2025-12-05 · 16:43 UTC

Beating GPT-5: DeepSeekMath-V2 Self-Corrects Logic Errors Presentational View Introduction Mathematics with the aid of artificial intelligence, is advancing rapidly. Innovations such as informal th...

#ai-in-mathematics #deepseekmath-v2 #deepseek-v3 #open-source-ai-model #theorem-proving

Origin | Interest | Match

#aiinmathematics #deepseekmathv2 #deepseekv3 #opensourceaimodel #theoremproving

AI Daily Post @[email protected] · 2025-12-04 · 00:02 UTC

New benchmark shows Gemini 3 Pro outpaces Gemini 2.5 in trust, ethics and safety—69% vs 16%. The study, led by Phelim Bradley and Prolific, also pits DeepSeek V3 against the models, highlighting gaps in performance and reasoning. Dive into the full analysis for the numbers and implications. #Gemini3Pro #Gemini2_5 #DeepSeekV3 #TrustAndSafety

🔗 https://aidailypost.com/news/gemini-3-pro-tops-trust-ethics-safety-69-vs-16-gemini-25

#gemini3pro #gemini2_5 #deepseekv3 #trustandsafety

AI Daily Post @[email protected] · 2025-12-03 · 18:16 UTC

New benchmarks show Mixture‑of‑Experts models on NVIDIA’s Blackwell NVL72 run up to 10× faster than on Hopper GPUs. The GB200 architecture and DeepSeek‑V3 optimizations push open‑source AI research forward. Dive into the details and see how this leap could reshape training pipelines. #MixtureOfExperts #NVIDIA #Blackwell #DeepSeekV3

🔗 https://aidailypost.com/news/mixtureofexperts-ai-models-run-10-faster-nvidia-blackwell-nvl72

#mixtureofexperts #nvidia #blackwell #deepseekv3

michabbb @[email protected] · 2025-10-01 · 00:45 UTC

🚀 Welcome GLM-4.6 the Latest flagship #opensource #AI #llm with advanced agentic, reasoning & coding capabilities

⚡ Performance improvements over #GLM45 with competitive advantages against #DeepSeekV3 and #ClaudeSonnet4 across 8 public benchmarks covering agents, reasoning & coding

🧵 👇

#opensource #ai #llm #glm45 #deepseekv3 #claudesonnet4

michabbb @[email protected] · 2025-10-01 · 00:45 UTC

🚀 Welcome GLM-4.6 the Latest flagship #opensource #AI #llm with advanced agentic, reasoning & coding capabilities

⚡ Performance improvements over #GLM45 with competitive advantages against #DeepSeekV3 and #ClaudeSonnet4 across 8 public benchmarks covering agents, reasoning & coding

🧵 👇

#opensource #ai #llm #glm45 #deepseekv3 #claudesonnet4

TechCrunch | Startup and Technology News @[email protected] · 2025-09-29 · 20:56 UTC

DeepSeek: Everything you need to know about the AI chatbot app

https://web.brid.gy/r/https://techcrunch.com/2025/09/29/deepseek-everything-you-need-to-know-about-the-ai-chatbot-app/

#ai #deepseek #deepseekv3 #evergreens #explainer #generativeai

Habr @[email protected] · 2025-09-23 · 09:12 UTC

Насколько зацензурен и опасен DeepSeek?

Насколько предвзят искусственный интеллект? Принято ругать нейросети за трансляцию стереотипов человеческого мышления, которые были подсмотрены в датасетах предобучения. На деле ИИ куда более аккуратен, чем можно ожидать. Хороший пример — генерация фотографий бабочек. Как правило, дизайнеры-люди очень любят изображать бабочек в мёртвом виде. Дело в том, что энтомологи руководствуются строгими визуальными стандартами: вид сверху, расправленные на 180° крылья, чистый фон, симметрия.

https://habr.com/ru/articles/949540/

#DeepSeek #DeepSeekR1 #DeepSeekV3 #КНР #Китай #большие_языковые_модели #БЯМ #искусственный_интеллект #предвзятость #цензура

#цензура #предвзятость #искусственный_интеллект #бям #большие_языковые_модели #китай

dhanrajleela @[email protected] · 2025-08-27 · 07:14 UTC

https://technologiesinternetz.blogspot.com/2025/08/deepseek-v31-vs-gpt-5-vs-claude-41.html

DeepSeek V3.1 vs GPT-5 vs Claude 4.1: Which LLM Delivers the Best Value to Users?

#deepseekv3.1 #gpt5 #claude4.1 #LLM

#deepseekv3 #gpt5 #claude4 #llm

dhanrajleela @[email protected] · 2025-08-27 · 07:14 UTC

https://technologiesinternetz.blogspot.com/2025/08/deepseek-v31-vs-gpt-5-vs-claude-41.html

DeepSeek V3.1 vs GPT-5 vs Claude 4.1: Which LLM Delivers the Best Value to Users?

#deepseekv3.1 #gpt5 #claude4.1 #LLM

#deepseekv3 #gpt5 #claude4 #llm

Hacker News @[email protected] · 2025-08-21 · 19:59 UTC

DeepSeek-v3.1 Release

https://api-docs.deepseek.com/news/news250821

#HackerNews #DeepSeek #Release #DeepSeekv3.1 #TechNews #SoftwareUpdate

#hackernews #deepseek #release #deepseekv3 #technews #softwareupdate

Hacker News @[email protected] · 2025-08-21 · 19:59 UTC

DeepSeek-v3.1 Release

https://api-docs.deepseek.com/news/news250821

#HackerNews #DeepSeek #Release #DeepSeekv3.1 #TechNews #SoftwareUpdate

#hackernews #deepseek #release #deepseekv3 #technews #softwareupdate

Winbuzzer @[email protected] · 2025-04-27 · 13:01 UTC

New DeepSeek-R1T-Chimera Model Merges R1 Reasoning With Efficiency of V3-0324

#AI #LLMs #DeepSeekR1 #DeepSeekV3 #Chimera #OpenSourceAI #TNGTech #MoE #MachineLearning #TechNews #GenAI

https://winbuzzer.com/2025/04/27/new-deepseek-r1t-chimera-model-merges-r1-reasoning-with-efficiency-of-v3-0324-xcxwbn/

#llms #deepseekr1 #deepseekv3 #ai #chimera #opensourceai

Winbuzzer @[email protected] · 2025-04-27 · 13:01 UTC

New DeepSeek-R1T-Chimera Model Merges R1 Reasoning With Efficiency of V3-0324

#AI #LLMs #DeepSeekR1 #DeepSeekV3 #Chimera #OpenSourceAI #TNGTech #MoE #MachineLearning #TechNews #GenAI

https://winbuzzer.com/2025/04/27/new-deepseek-r1t-chimera-model-merges-r1-reasoning-with-efficiency-of-v3-0324-xcxwbn/

#llms #deepseekr1 #deepseekv3 #ai #chimera #opensourceai

eicker.TV ▹ Tech News @[email protected] · 2025-04-20 · 07:17 UTC

🧩 #Llama4Maverick nutzt 128 Experten für deutlich mehr Rechenleistung und schlägt sogar #GPT4o und #Gemini20 in Benchmarks – bei nur der Hälfte der aktiven Parameter von #DeepSeekv3.

🎓 Beide #KIModelle wurden mithilfe des riesigen Lehrmodells #Llama4 Behemoth trainiert, das mit 288 Milliarden aktiven Parametern zu den leistungsstärksten weltweit zählt.

👉 https://eicker.TV #Technik #Medien #Politik #Wirtschaft (2/2)

#llama4maverick #gpt4o #gemini20 #deepseekv3 #kimodelle #llama4

eicker.TV ▹ Tech News @[email protected] · 2025-04-20 · 07:17 UTC

🧩 #Llama4Maverick nutzt 128 Experten für deutlich mehr Rechenleistung und schlägt sogar #GPT4o und #Gemini20 in Benchmarks – bei nur der Hälfte der aktiven Parameter von #DeepSeekv3.

🎓 Beide #KIModelle wurden mithilfe des riesigen Lehrmodells #Llama4 Behemoth trainiert, das mit 288 Milliarden aktiven Parametern zu den leistungsstärksten weltweit zählt.

👉 https://eicker.TV #Technik #Medien #Politik #Wirtschaft (2/2)

#llama4maverick #gpt4o #gemini20 #deepseekv3 #kimodelle #llama4

Pyrzout :vm: @[email protected] · 2025-04-05 · 06:10 UTC

Benchmarks Find ‘DeepSeek-V3-0324 Is More Vulnerable Than Qwen2.5-Max’ – Source: www.techrepublic.com https://ciso2ciso.com/benchmarks-find-deepseek-v3-0324-is-more-vulnerable-than-qwen2-5-max-source-www-techrepublic-com/ #threatsandvulnerabilities #rssfeedpostgeneratorecho #ArtificialIntelligence #SecurityonTechRepublic #SecurityTechRepublic #CyberSecurityNews #Cybersecurity #AIsecurity #deepseekv3 #qwen25max #AImodels #DeepSeek #Security #Alibaba #News #AI

#threatsandvulnerabilities #rssfeedpostgeneratorecho #artificialintelligence #securityontechrepublic #securitytechrepublic #cybersecuritynews

Pyrzout :vm: @[email protected] · 2025-04-05 · 06:10 UTC

Benchmarks Find ‘DeepSeek-V3-0324 Is More Vulnerable Than Qwen2.5-Max’ – Source: www.techrepublic.com https://ciso2ciso.com/benchmarks-find-deepseek-v3-0324-is-more-vulnerable-than-qwen2-5-max-source-www-techrepublic-com/ #threatsandvulnerabilities #rssfeedpostgeneratorecho #ArtificialIntelligence #SecurityonTechRepublic #SecurityTechRepublic #CyberSecurityNews #Cybersecurity #AIsecurity #deepseekv3 #qwen25max #AImodels #DeepSeek #Security #Alibaba #News #AI

#threatsandvulnerabilities #rssfeedpostgeneratorecho #artificialintelligence #securityontechrepublic #securitytechrepublic #cybersecuritynews

Karl Voit :emacs: :orgmode: @[email protected] · 2025-03-28 · 15:57 UTC

Studie: #KI #Chatbots sind beim Zitieren von #News unbrauchbar
https://www.derstandard.at/story/3000000261220/studie-ki-chatbots-sind-beim-zitieren-von-news-unbrauchbar

"Untersucht wurden #ChatGPT Search (#OpenAI), #Perplexity, Perplexity Pro (Perplexity AI), #Gemini 2.0 Flash (#Google), #DeepseekV3 Search (#Deepseek), #Grok-2 Search, Grok-3 Search Beta (#xAI) sowie #Copilot (#Microsoft und OpenAI)."

"#Grok3 [...] lieferte gleich in 96 Prozent aller Fälle falsche Antworten." 🤣

#Nachrichten #Algorithmen #Automatisierung

#ki #chatbots #news #chatgpt #openai #perplexity

Karl Voit :emacs: :orgmode: @[email protected] · 2025-03-28 · 15:57 UTC

Studie: #KI #Chatbots sind beim Zitieren von #News unbrauchbar
https://www.derstandard.at/story/3000000261220/studie-ki-chatbots-sind-beim-zitieren-von-news-unbrauchbar

"Untersucht wurden #ChatGPT Search (#OpenAI), #Perplexity, Perplexity Pro (Perplexity AI), #Gemini 2.0 Flash (#Google), #DeepseekV3 Search (#Deepseek), #Grok-2 Search, Grok-3 Search Beta (#xAI) sowie #Copilot (#Microsoft und OpenAI)."

"#Grok3 [...] lieferte gleich in 96 Prozent aller Fälle falsche Antworten." 🤣

#Nachrichten #Algorithmen #Automatisierung

#ki #chatbots #news #chatgpt #openai #perplexity

Hacker News @[email protected] · 2025-03-27 · 04:37 UTC

DeepSeek-V3 Technical Report

https://arxiv.org/abs/2412.19437

#HackerNews #DeepSeekV3 #TechnicalReport #AIResearch #MachineLearning #Arxiv #TechNews

#hackernews #deepseekv3 #technicalreport #airesearch #machinelearning #arxiv

Hacker News @[email protected] · 2025-03-27 · 04:37 UTC

DeepSeek-V3 Technical Report

https://arxiv.org/abs/2412.19437

#HackerNews #DeepSeekV3 #TechnicalReport #AIResearch #MachineLearning #Arxiv #TechNews

#hackernews #deepseekv3 #technicalreport #airesearch #machinelearning #arxiv

B166IR @[email protected] · 2025-03-25 · 19:54 UTC

#deepseek #v3 #deepseekv3 #ai #tech #0324

#deepseek #v3 #deepseekv3 #ai #tech

Cloudbooklet @[email protected] · 2025-03-25 · 11:02 UTC

DeepSeek releases DeepSeek-V3-0324 on Hugging Face!

#DeepSeek #AI #MachineLearning #DeepSeekV3 #HuggingFace #ArtificialIntelligence #AIModel

#deepseek #ai #machinelearning #deepseekv3 #huggingface #artificialintelligence

tech news ᳇ eicker.news @[email protected] · 2025-03-25 · 10:16 UTC

»Chinese #AIlab #DeepSeek just released the latest version of their enormous #DeepSeekv3 model: The license is #MIT (that's new - previous DeepSeek v3 had a custom license).« https://simonwillison.net/2025/Mar/24/deepseek/?eicker.news #tech #media

#ailab #deepseek #deepseekv3 #mit #tech #media

tech news ᳇ eicker.news @[email protected] · 2025-03-25 · 10:16 UTC

»Chinese #AIlab #DeepSeek just released the latest version of their enormous #DeepSeekv3 model: The license is #MIT (that's new - previous DeepSeek v3 had a custom license).« https://simonwillison.net/2025/Mar/24/deepseek/?eicker.news #tech #media

#ailab #deepseek #deepseekv3 #mit #tech #media

Winbuzzer @[email protected] · 2025-03-24 · 16:58 UTC

DeepSeek’s new V3-0324 AI model has launched quietly, offering efficient performance on a Mac Studio

#AI #DeepSeekV3 #DeepSeek #DeepSeekV30324 #GenAI #LLM #OpenSourceAI #AIModels

https://winbuzzer.com/2025/03/24/deepseeks-new-641gb-ai-model-lands-quietly-and-runs-surprisingly-fast-on-a-mac-xcxwbn/

#ai #deepseekv3 #deepseek #deepseekv30324 #genai #llm

Winbuzzer @[email protected] · 2025-03-24 · 16:58 UTC

DeepSeek’s new V3-0324 AI model has launched quietly, offering efficient performance on a Mac Studio

#AI #DeepSeekV3 #DeepSeek #DeepSeekV30324 #GenAI #LLM #OpenSourceAI #AIModels

https://winbuzzer.com/2025/03/24/deepseeks-new-641gb-ai-model-lands-quietly-and-runs-surprisingly-fast-on-a-mac-xcxwbn/

#ai #deepseekv3 #deepseek #deepseekv30324 #genai #llm

SoyaCincau @[email protected] · 2025-02-07 · 07:01 UTC

DeepSeek: ChatGPT killer or just another hype train? We compare it against ChatGPT and Gemini #apps #chatgpt #deepseek #deepseekr1 #deepseekv3 #digitallife #featured #gemini #geminiai #googlegemini #openai #video

https://soyacincau.com/2025/02/07/deepseek-chatgpt-killer-or-just-another-hype-train-we-compare-it-against-chatgpt-and-gemini/

#apps #chatgpt #deepseek #deepseekr1 #deepseekv3 #digitallife

KINEWS24 @[email protected] · 2025-02-04 · 14:34 UTC

DeepSeek und die Geschichte von Liang Wenfeng!

Gründung 2023
DeepSeek R1 übertrifft ChatGPT
Effiziente KI-Modelle
Weniger Ressourcen nötig

#ai #ki #artificialintelligence #kuenstlicheintelligenz #deepseek #deepseekr1 #deepseekv3 #liangwenfeng #technologie

https://kinews24.de/deepseek-und-die-geschichte-von-lian-wenfeng/

#ai #ki #artificialintelligence #kuenstlicheintelligenz #deepseek #deepseekr1

KINEWS24 @[email protected] · 2025-02-04 · 14:34 UTC

DeepSeek und die Geschichte von Liang Wenfeng!

Gründung 2023
DeepSeek R1 übertrifft ChatGPT
Effiziente KI-Modelle
Weniger Ressourcen nötig

#ai #ki #artificialintelligence #kuenstlicheintelligenz #deepseek #deepseekr1 #deepseekv3 #liangwenfeng #technologie

https://kinews24.de/deepseek-und-die-geschichte-von-lian-wenfeng/

#ai #ki #artificialintelligence #kuenstlicheintelligenz #deepseek #deepseekr1

Radargit @[email protected] · 2025-02-03 · 11:32 UTC

🚀 DeepSeek V3 vs ChatGPT-4o: Which One Reigns Supreme?🤖

AI is evolving fast! 🏎️ DeepSeek V3 and ChatGPT-4o are two of the most powerful LLMs in 2025. But which one is better?

🔍 We compare:
✅ Accuracy & performance
✅ Multimodal capabilities
✅ Speed & efficiency
✅ Real-world applications

📖 Read the full breakdown here:

https://radargit.com/2025/02/03/deepseek-v3-vs-chatgpt-4o-which-one-is-better/

Which AI model do you prefer? Comment below! 👇

#AI #DeepSeekV3 #ChatGPT4o #ArtificialIntelligence #Tech #MachineLearning #AICompari

#ai #deepseekv3 #chatgpt4o #artificialintelligence #tech #machinelearning

:fckafd: Olli Graf🚟 @[email protected] · 2025-02-01 · 14:30 UTC

Das wird noch etwas dauern. #ollama #deepseekv3

#ollama #deepseekv3

Pyrzout :vm: @[email protected] · 2025-01-31 · 03:00 UTC

DeepSeek Locked Down Public Database Access That Exposed Chat History – Source: www.techrepublic.com https://ciso2ciso.com/deepseek-locked-down-public-database-access-that-exposed-chat-history-source-www-techrepublic-com/ #rssfeedpostgeneratorecho #ArtificialIntelligence #SecurityonTechRepublic #SecurityTechRepublic #CyberSecurityNews #SecurityResearch #databaseleakage #International #GenerativeAI #wizresearch #clickhouse #deepseekr1 #deepseekv3 #opensource #DeepSeek #openaio1 #Security #BigData

#rssfeedpostgeneratorecho #artificialintelligence #securityontechrepublic #securitytechrepublic #cybersecuritynews #securityresearch

Pyrzout :vm: @[email protected] · 2025-01-31 · 03:00 UTC

DeepSeek Locked Down Public Database Access That Exposed Chat History – Source: www.techrepublic.com https://ciso2ciso.com/deepseek-locked-down-public-database-access-that-exposed-chat-history-source-www-techrepublic-com/ #rssfeedpostgeneratorecho #ArtificialIntelligence #SecurityonTechRepublic #SecurityTechRepublic #CyberSecurityNews #SecurityResearch #databaseleakage #International #GenerativeAI #wizresearch #clickhouse #deepseekr1 #deepseekv3 #opensource #DeepSeek #openaio1 #Security #BigData

#rssfeedpostgeneratorecho #artificialintelligence #securityontechrepublic #securitytechrepublic #cybersecuritynews #securityresearch

Lorenzo @[email protected] · 2025-01-30 · 19:15 UTC

Research Firm Wiz Research began investigating DeepSeek soon after its generative AI took the tech world by storm.#artificialintelligence #clickhouse #databaseleakage #deepseek #deepseekr1 #deepseek-v3 #generativeai #openaio1 #security #securityresearch #wizresearch
DeepSeek Locked Down Public Database Access That Exposed Chat History

#artificialintelligence #security #generativeai #securityresearch #clickhouse #deepseek

José Murilo @[email protected] · 2025-01-30 · 17:36 UTC

"A key component of the success is that it is #opensource. #DeepSeek-V3 is on GitHub with detailed docs on how it can be replicated. This has fueled a rush of people to try to make their own models." https://baixacultura.org/2025/01/29/a-corrida-da-ia-ganha-um-novo-capitulo-chines-e-open-source/

A corrida da IA ganha um novo ...

#opensource #deepseekv3

Lorenzo @[email protected] · 2025-01-29 · 20:13 UTC

The Chinese firm said training the model cost just $5.6 million. Alibaba Cloud followed with a new generative AI model, while Microsoft alleges DeepSeek ‘distilled’ OpenAI’s work.#artificialintelligence #chatgpt #deepseek #deepseekr1 #deepseek-v3 #generativeai #Microsoft #nvidia #openai #reasoningmodels
DeepSeek Chatbot Beats OpenAI on App Store Leaderboard

#microsoft #artificialintelligence #nvidia #openai #generativeai #chatgpt

SoyaCincau @[email protected] · 2025-01-28 · 04:00 UTC

DeepSeek: China’s answer to ChatGPT is causing havoc, Nvidia loses nearly USD 600 bil in market cap #ai #apps #chatgpt #china #deepseek #deepseekr1 #deepseekv3 #digitallife #featured #news #tech

https://soyacincau.com/2025/01/28/deepseek-china-answer-to-chatgpt-nvidia-loses-nearly-600bil/

#ai #chatgpt #china #deepseek #deepseekr1 #deepseekv3

Child of darkness @[email protected] · 2025-01-27 · 20:10 UTC

Anyone out there who tried to run #deepseek V3 locally on a #linux machine? I'm curious if it can run with a consumer #nvidia or #amd card?

#deepseekv3

#deepseekv3 #amd #nvidia #linux #deepseek

Lety Does Stuff @[email protected] · 2025-01-27 · 09:59 UTC

Hey everyone, and welcome to Lety Does Unironically Posting About Girls Going to Glory Holes on Main Cause She's Really Upset About All the AI Misinformation Going Around Right Now

https://peertube.doesstuff.social/w/e7zTdbpHXKWjSjK9N6EXbW

#DeepSeek #DeepSeekR1 #DeepSeekV3 #OpenAI #ChatGPT #AI #LLM

#deepseek #deepseekr1 #deepseekv3 #openai #chatgpt #ai

:rss: Qiita - 人気の記事 @[email protected] · 2025-01-18 · 11:59 UTC

個人開発したサービスが２日で3 万リーチした話
https://qiita.com/nogu66/items/93468b490cd26c34cc67?utm_campaign=popular_items&utm_medium=feed&utm_source=popular_items

#qiita #TypeScript #個人開発 #Next_js #Dify #DeepSeekV3

#qiita #typescript #個人開発 #next_js #dify #deepseekv3

ThunDroid Blog @[email protected] · 2025-01-16 · 10:34 UTC

DeepSeek V3: The Model That Redefines Search Forever
#Deepseekv3
https://thundroid.co/deepseek-v3-the-model-that-redefines-search-forever/

#deepseekv3

KINEWS24 @[email protected] · 2025-01-06 · 20:12 UTC

DeepSeek V3 vs. Gemini 2.0: Wer dominiert?

- DeepSeek V3: 671 Mrd. Parameter!
- Gemini 2.0: Multimodal & 1 Mio. Token-Kontext!
- Revolution in Effizienz & Vielseitigkeit.

#ai #ki #deepseekv3 #gemini2 #artificialintelligence

https://kinews24.de/deepseek-v3-vs-gemini-2-0/

#ai #ki #deepseekv3 #gemini2 #artificialintelligence

Debby ‬⁂📎🐧:disability_flag: @[email protected] · 2025-01-05 · 21:05 UTC

DeepSeek-V3: A New Era in Open-Source AI with a Comedic Twist

In the ever-evolving landscape of artificial intelligence, public and open models are once again catching up with their proprietary counterparts. The recent launch of DeepSeek-V3 has stirred excitement by outperforming even Sonnet and ChatGPT in certain benchmarks. This open-source model, developed by Hangzhou DeepSeek Artificial Intelligence and Beijing DeepSeek Artificial Intelligence, boasts an impressive 671 billion parameters, making it the largest model in the open-source community. With its advanced Mixture-of-Experts (MoE) architecture and innovative technologies, DeepSeek-V3 not only outperforms its open-source counterparts like LLaMA but also rivals closed models such as Sonnet 3.5 and ChatGPT-4o.
The Power of Open Source

One of the most remarkable aspects of DeepSeek-V3 is its commitment to accessibility. By being open-source, it allows researchers and developers from around the globe to experiment, innovate, and contribute to the AI community. While it does require around 400GB of RAM to run locally, that likely won’t deter anyone from deploying it on their server. The model's documentation and training frameworks are readily available on platforms like Hugging Face, fostering collaboration and knowledge sharing. This democratization of AI technology is a significant step forward, enabling a diverse range of applications from education to programming.
Technical Innovations

DeepSeek-V3 is not just about size; it’s about performance. With a processing speed of 60 tokens per second—three times faster than its predecessor—this model is designed for efficiency. The incorporation of FP8 mixed precision training reduces GPU memory consumption without sacrificing accuracy, while the DualPipe algorithm enhances processing efficiency. These advancements not only improve performance but also keep training costs competitive, making DeepSeek-V3 a viable option for various applications.
A Comedic Twist: The Identity Crisis

However, the launch of DeepSeek-V3 has not been without its quirks. In a rather amusing turn of events, the model seems to have developed an identity crisis, often identifying itself as ChatGPT during interactions. This phenomenon has sparked laughter and intrigue within the tech community. As reported by TechCrunch, DeepSeek-V3 claimed to be a version of OpenAI’s GPT-4 model in five out of eight generations during tests. This raises questions about the model's training data and the potential for "hallucinations"—a term used to describe AI's tendency to generate inaccurate or misleading information.

While some may find this amusing, it highlights a critical issue in the AI field: the challenge of ensuring the integrity and accuracy of training data. As AI models increasingly draw from a web saturated with AI-generated content, the risk of misidentification and misinformation grows. This situation serves as a reminder of the importance of transparency and ethical practices in AI development.
Looking Ahead

Despite its comedic missteps, DeepSeek-V3 stands as a testament to the potential of open-source AI. Its impressive performance metrics and commitment to accessibility position it as a benchmark in the industry. As the DeepSeek team continues to innovate—introducing features like “Deep Roles” for customizable AI interactions—the future looks bright for this model.

In conclusion, DeepSeek-V3 not only represents a significant leap forward in open-source AI technology but also provides a lighthearted reminder of the complexities and quirks inherent in artificial intelligence. As we navigate this exciting frontier, let’s embrace both the advancements and the occasional hilarity that comes with it. Whether you’re a developer looking to explore new possibilities or simply someone who enjoys a good laugh at AI’s expense, DeepSeek-V3 is worth keeping an eye on.

Try it out here: DeepSeek-V3 https://chat.deepseek.com/sign_in

Explore more on Hugging Face: Hugging Face - DeepSeek-V3 https://huggingface.co/deepseek-ai/DeepSeek-V3

#DeepSeekV3 #OpenSourceAI #ArtificialIntelligence #AIInnovation #TechNews #MachineLearning #AICommunity #DeepLearning #AIModels #HuggingFace
#DeepSeek #DeepSeekV3

#deepseekv3 #opensourceai #artificialintelligence #aiinnovation #technews #machinelearning

rexi @[email protected] · 2025-01-01 · 22:55 UTC

https://analyticsindiamag.com/ai-news-updates/deepseek-v3-is-the-best-open-source-ai-model/

Chinese AI research lab backed by High-Flyer Capital Management has released #DeepSeekV3 open-source Mixture-of-Experts model features a total of 671B total parameters, with 37B activated for each token. The model has been trained on 14.8T tokens. DeepSeek has released the model on GitHub and a detailed technical paper outlining its capabilities.

DeepSeek AI also released the benchmark scores, and it outperformed Meta’s flagship llama3.1 405B parameter model…

#deepseekv3