Sign in Create account

#quantization — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #quantization, aggregated by home.social.

Brandon @[email protected] · 2026-05-04 · 17:44 UTC

optimization-kernels: C++ kernels and utilities for quantization and inference optimization.
👉 https://github.com/brandonhimpfen/optimization-kernels
#ai #artificialintelligence #machinelearning #llm #inference #quantization

#ai #artificialintelligence #machinelearning #llm #inference #quantization
रञ्जित (Ranjit Mathew) @[email protected] · 2026-04-29 · 13:19 UTC

An excellent introduction to #quantization used for #LLMs 👌🏽:
“Quantization From The Ground Up”, Sam Rose, Ngrok (https://ngrok.com/blog/quantization).
On HN: https://news.ycombinator.com/item?id=47519295
#AI #Math #FloatingPoint #NumericalAnalysis #Numbers #NeuralNetworks #Precision #Accuracy

#quantization #llms #ai #math #floatingpoint #numericalanalysis
रञ्जित (Ranjit Mathew) @[email protected] · 2026-04-29 · 13:19 UTC

An excellent introduction to #quantization used for #LLMs 👌🏽:
“Quantization From The Ground Up”, Sam Rose, Ngrok (https://ngrok.com/blog/quantization).
On HN: https://news.ycombinator.com/item?id=47519295
#AI #Math #FloatingPoint #NumericalAnalysis #Numbers #NeuralNetworks #Precision #Accuracy

#quantization #llms #ai #math #floatingpoint #numericalanalysis
रञ्जित (Ranjit Mathew) @[email protected] · 2026-04-29 · 13:19 UTC

An excellent introduction to #quantization used for #LLMs 👌🏽:
“Quantization From The Ground Up”, Sam Rose, Ngrok (https://ngrok.com/blog/quantization).
On HN: https://news.ycombinator.com/item?id=47519295
#AI #Math #FloatingPoint #NumericalAnalysis #Numbers #NeuralNetworks #Precision #Accuracy

#quantization #llms #ai #math #floatingpoint #numericalanalysis
रञ्जित (Ranjit Mathew) @[email protected] · 2026-04-29 · 13:19 UTC

An excellent introduction to #quantization used for #LLMs 👌🏽:
“Quantization From The Ground Up”, Sam Rose, Ngrok (https://ngrok.com/blog/quantization).
On HN: https://news.ycombinator.com/item?id=47519295
#AI #Math #FloatingPoint #NumericalAnalysis #Numbers #NeuralNetworks #Precision #Accuracy

#quantization #llms #ai #math #floatingpoint #numericalanalysis
रञ्जित (Ranjit Mathew) @[email protected] · 2026-04-28 · 13:14 UTC

Impressive:
“TurboQuant: Redefining AI Efficiency With Extreme Compression”, Amir Zandieh, et al, Google Research (https://research.google/blog/turboquant-redefining-ai-efficiency-with-extreme-compression/).
The paper: https://arxiv.org/abs/2504.19874
On HN: https://news.ycombinator.com/item?id=47513475
#TurboQuant #Quantization #LLMs #Vectors #Compression #Paper

#turboquant #quantization #llms #vectors #compression #paper
रञ्जित (Ranjit Mathew) @[email protected] · 2026-04-28 · 13:14 UTC

Impressive:
“TurboQuant: Redefining AI Efficiency With Extreme Compression”, Amir Zandieh, et al, Google Research (https://research.google/blog/turboquant-redefining-ai-efficiency-with-extreme-compression/).
The paper: https://arxiv.org/abs/2504.19874
On HN: https://news.ycombinator.com/item?id=47513475
#TurboQuant #Quantization #LLMs #Vectors #Compression #Paper

#turboquant #quantization #llms #vectors #compression #paper
रञ्जित (Ranjit Mathew) @[email protected] · 2026-04-28 · 13:14 UTC

Impressive:
“TurboQuant: Redefining AI Efficiency With Extreme Compression”, Amir Zandieh, et al, Google Research (https://research.google/blog/turboquant-redefining-ai-efficiency-with-extreme-compression/).
The paper: https://arxiv.org/abs/2504.19874
On HN: https://news.ycombinator.com/item?id=47513475
#TurboQuant #Quantization #LLMs #Vectors #Compression #Paper

#turboquant #quantization #llms #vectors #compression #paper
रञ्जित (Ranjit Mathew) @[email protected] · 2026-04-28 · 13:14 UTC

Impressive:
“TurboQuant: Redefining AI Efficiency With Extreme Compression”, Amir Zandieh, et al, Google Research (https://research.google/blog/turboquant-redefining-ai-efficiency-with-extreme-compression/).
The paper: https://arxiv.org/abs/2504.19874
On HN: https://news.ycombinator.com/item?id=47513475
#TurboQuant #Quantization #LLMs #Vectors #Compression #Paper

#turboquant #quantization #llms #vectors #compression #paper
N-gated Hacker News @[email protected] · 2026-04-04 · 15:27 UTC

🚀🌐 Oh great, now #Google wants us to #turbocharge our #browsers with "vector quantization" mumbo-jumbo that requires versions of Chrome, Firefox, and Safari that don't even exist yet. 🤖 Because who doesn't want to compress their vectors in 3 bits/dim while their browsers and brains crash simultaneously. 🙄
https://github.com/teamchong/turboquant-wasm #Vector #Quantization #Browser #Update #TechNews #HackerNews #ngated

#google #turbocharge #browsers #vector #quantization #browser
Australia News Beep @[email protected] · 2026-03-31 · 05:20 UTC

New AI Breakthrough May Bring Full FSD V14 to Tesla’s HW3 Vehicles
March 30, 2026 By Karan Singh For owners of Tesla vehicles equipped with HW3, the wait for the…
#NewsBeep #News #Artificialintelligence #AI #AI4 #ArtificialIntelligence #AU #Australia #FSD #HW3 #memory #neuralnetworks #Nvidia #Quantization #Technology #TESLA
https://www.newsbeep.com/au/575661/

#tesla #technology #quantization #nvidia #neuralnetworks #memory
United Kingdom News Beep @[email protected] · 2026-03-30 · 20:20 UTC

New AI Breakthrough May Bring Full FSD V14 to Tesla’s HW3 Vehicles
March 30, 2026 By Karan Singh For owners of Tesla vehicles equipped with HW3, the wait for the…
#NewsBeep #News #Artificialintelligence #AI #Ai4 #ArtificialIntelligence #FSD #HW3 #Memory #neuralnetworks #Nvidia #Quantization #Technology #Tesla #UK #UnitedKingdom
https://www.newsbeep.com/uk/503987/

#unitedkingdom #uk #tesla #technology #quantization #nvidia
Canada News Beep @[email protected] · 2026-03-30 · 18:20 UTC

New AI Breakthrough May Bring Full FSD V14 to Tesla’s HW3 Vehicles
March 30, 2026 By Karan Singh For owners of Tesla vehicles equipped with HW3, the wait for the…
#NewsBeep #News #Artificialintelligence #AI #AI4 #ArtificialIntelligence #CA #Canada #FSD #hw3 #memory #neuralnetworks #Nvidia #Quantization #Technology #TESLA
https://www.newsbeep.com/ca/571227/

#tesla #technology #quantization #nvidia #neuralnetworks #memory
United States News Beep @[email protected] · 2026-03-30 · 17:20 UTC

New AI Breakthrough May Bring Full FSD V14 to Tesla’s HW3 Vehicles
March 30, 2026 By Karan Singh For owners of Tesla vehicles equipped with HW3, the wait for the…
#NewsBeep #News #US #USA #UnitedStates #UnitedStatesOfAmerica #Artificialintelligence #AI #Ai4 #ArtificialIntelligence #FSD #HW3 #Memory #neuralnetworks #NVIDIA #Quantization #Technology #Tesla
https://www.newsbeep.com/us/554537/

#tesla #technology #quantization #nvidia #neuralnetworks #memory
UKP Lab @[email protected] · 2026-03-27 · 09:38 UTC

Authors: Federico Marcuzzi (INSAIT - Institute for Computer Science, Artificial Intelligence and Technology), Xuefei Ning (Tsinghua University), Roy Schwartz (The Hebrew University of Jerusalem), and Iryna Gurevych (UKP Lab, Technische Universität Darmstadt and ATHENE Center).
See you at #EACL2026 in Rabat 🕌!
#UKPLab #NLProc #ResponsibleAI #Quantization #MLSafety #Fairness #TrustworthyAI #ModelCompression #LLMSafety #EthicalAI #NLP #AIResearch

#eacl2026 #ukplab #nlproc #responsibleai #quantization #mlsafety
UKP Lab @[email protected] · 2026-03-27 · 09:38 UTC

Authors: Federico Marcuzzi (INSAIT - Institute for Computer Science, Artificial Intelligence and Technology), Xuefei Ning (Tsinghua University), Roy Schwartz (The Hebrew University of Jerusalem), and Iryna Gurevych (UKP Lab, Technische Universität Darmstadt and ATHENE Center).
See you at #EACL2026 in Rabat 🕌!
#UKPLab #NLProc #ResponsibleAI #Quantization #MLSafety #Fairness #TrustworthyAI #ModelCompression #LLMSafety #EthicalAI #NLP #AIResearch

#eacl2026 #ukplab #nlproc #responsibleai #quantization #mlsafety
UKP Lab @[email protected] · 2026-03-27 · 09:38 UTC

Authors: Federico Marcuzzi (INSAIT - Institute for Computer Science, Artificial Intelligence and Technology), Xuefei Ning (Tsinghua University), Roy Schwartz (The Hebrew University of Jerusalem), and Iryna Gurevych (UKP Lab, Technische Universität Darmstadt and ATHENE Center).
See you at #EACL2026 in Rabat 🕌!
#UKPLab #NLProc #ResponsibleAI #Quantization #MLSafety #Fairness #TrustworthyAI #ModelCompression #LLMSafety #EthicalAI #NLP #AIResearch

#eacl2026 #ukplab #nlproc #responsibleai #quantization #mlsafety
UKP Lab @[email protected] · 2026-03-27 · 09:38 UTC

Authors: Federico Marcuzzi (INSAIT - Institute for Computer Science, Artificial Intelligence and Technology), Xuefei Ning (Tsinghua University), Roy Schwartz (The Hebrew University of Jerusalem), and Iryna Gurevych (UKP Lab, Technische Universität Darmstadt and ATHENE Center).
See you at #EACL2026 in Rabat 🕌!
#UKPLab #NLProc #ResponsibleAI #Quantization #MLSafety #Fairness #TrustworthyAI #ModelCompression #LLMSafety #EthicalAI #NLP #AIResearch

#airesearch #nlp #ethicalai #llmsafety #modelcompression #trustworthyai
UKP Lab @[email protected] · 2026-03-27 · 09:38 UTC

Authors: Federico Marcuzzi (INSAIT - Institute for Computer Science, Artificial Intelligence and Technology), Xuefei Ning (Tsinghua University), Roy Schwartz (The Hebrew University of Jerusalem), and Iryna Gurevych (UKP Lab, Technische Universität Darmstadt and ATHENE Center).
See you at #EACL2026 in Rabat 🕌!
#UKPLab #NLProc #ResponsibleAI #Quantization #MLSafety #Fairness #TrustworthyAI #ModelCompression #LLMSafety #EthicalAI #NLP #AIResearch

#eacl2026 #ukplab #nlproc #responsibleai #quantization #mlsafety
N-gated Hacker News @[email protected] · 2026-03-25 · 16:38 UTC

🎉 Wow, an article longer than the collective thoughts of its intended audience! Sam Rose seems to think we're all aspiring data scientists with infinite free time and an endless love for #quantization. 😂 6,658 words later, we're left with an 80 billion-parameter headache and absolutely zero desire to quantize anything ever again. 🚀🔢
https://ngrok.com/blog/quantization #HackerNews #DataScience #LongRead #Humor #HackerNews #ngated

#quantization #hackernews #datascience #longread #humor #ngated
Hacker News @[email protected] · 2026-03-25 · 16:38 UTC

Quantization from the Ground Up
https://ngrok.com/blog/quantization
#HackerNews #Quantization #Ground #Up #Machine #Learning #AI #Technology #Blog

#hackernews #quantization #ground #up #machine #learning
Rost Glukhov @[email protected] · 2026-02-09 · 10:13 UTC

Compare GGUF, GPTQ, and AWQ quantization formats for LLMs on consumer GPUs. Learn how to balance model quality, speed, and memory usage with Q4_K_M, IQ4_XS, and Q3_K_S variants for optimal inference performance.
#GGUF #quantization #LLM inference #GPU optimization #model deployment
https://dasroot.net/posts/2026/02/gguf-quantization-quality-speed-consumer-gpus/

#gguf #quantization #llm #gpu #model
Reddit Tech VN Bot @[email protected] · 2026-02-01 · 00:22 UTC

🧠 Tại sao định dạng NVFP8/MXFP8 không được quan tâm trong llama.cpp hay VLLM dù có độ chính xác cao hơn FP8 và được tối ưu trên kiến trúc Blackwell? Câu hỏi mở cho cộng đồng AI!
#AI #MachineLearning #Quantization #ĐịnhDạng #TríTuệNhânTạo #HọcMáy
https://www.reddit.com/r/LocalLLaMA/comments/1qsi8n2/why_no_nvfp8_or_mxfp8/

#ai #machinelearning #quantization #dịnhdạng #trituệnhantạo #họcmay
Reddit Tech VN Bot @[email protected] · 2026-01-31 · 12:23 UTC

Một người dùng Reddit đã so sánh 3 phương pháp lượng tử hóa 4-bit (Q4_K_M, Q4_K_XL và MXFP4) trên mô hình GLM-4.7-Flash và Nemotron-3-nano. MXFP4 cho perplexity thấp hơn (10.72 PPL) và tải nhanh hơn so với Q4_K_M (16.17 PPL). Nó cũng tiết kiệm 17% VRAM và tăng tốc xử lý lên 5% so với Q4_K_XL. Kết quả này cho thấy MXFP4 có thể là lựa chọn tối ưu cho mô hình lớn từ 30–32B tham số. #AI #Quantization #MôHìnhĐịnhLượng #TríTuệNhânTạo #HọcMáy
https://www.reddit.com/r/LocalLLaMA/comments/1qrzyaz/i_foun

#ai #quantization #mohinhdịnhlượng #trituệnhantạo #họcmay
Reddit Tech VN Bot @[email protected] · 2026-01-31 · 08:24 UTC

So sánh quantization MXFP4 vs Q4_K_M/XL trên mô hình GLM-4.7-Flash:
📉 Kết quả bất ngờ: MXFP4 có chỉ số Perplexity (PPL) thấp hơn (~10.72) so với Q4_K_XL (~15.73), dù kích thước file nhỏ hơn (15.79 GiB so với 16.31 GiB).
🚀 Tốc độ: MXFP4 cho tốc độ xử lý nhanh hơn và tiết kiệm VRAM hơn.
🤔 Câu hỏi đặt ra: Liệu PPL thấp hơn có đồng nghĩa với việc cải thiện khả năng gọi công cụ (tool-calling) và lập trình?
#LLM #AI #Quantization #MXFP4 #MachineLearning #CongNghe #LocalLLM
https://www.reddit.com

#llm #ai #quantization #mxfp4 #machinelearning #congnghe
Reddit Tech VN Bot @[email protected] · 2026-01-30 · 08:19 UTC

Benchmark trên RTX 4070 Super (12 GB) cho thấy Qwen 2.5 Coder 7B (AWQ Int4) nhanh hơn 24 % (44.6 TPS) và dùng ít VRAM hơn (9.49 GB) so với Qwen 2.5 3B FP16 (35.9 TPS, 10 GB). Kết luận: mô hình lớn đã được định lượng đáp ứng tốt hơn trên GPU tiêu dùng. #AI #Quantization #Benchmark #RTX4070 #LLM #TríTuệNhânTạo #địnhlượng #đánhgiá
https://www.reddit.com/r/LocalLLaMA/comments/1qqz7mi/benchmark_the_power_of_quantization_qwen_25_coder/

#ai #quantization #benchmark #rtx4070 #llm #trituệnhantạo
Reddit Tech VN Bot @[email protected] · 2026-01-29 · 15:30 UTC

Tôi đang chạy mô hình QwQ 32B trên LM Studio với lượng hóa 4 bit, tối ưu K/V cache giúp tăng tốc độ xử lý lên 3 lần (40k context thay vì 10k), đồng thời giảm VRAM xuống 19GB/24GB. Tuy nhiên, việc giảm K/V cache xuống 4 bit có ảnh hưởng nhiều đến độ chính xác? Đây là cách tối ưu hiệu quả cho vai trò trò chuyện/role-play với LLM cục bộ. #AI #MáyHọc #LLM #TốiƯuHóa #Quantization #KVTuning
https://www.reddit.com/r/ollama/comments/1qqan74/effects_of_quantized_kv_cache_on_an_already/

#ai #mayhọc #llm #tốiưuhoa #quantization #kvtuning
Ai Story News @aistorynews · 2026-01-19 · 00:13 UTC

Scientific Reports precision medicine AI launches a clinic-first Collection as ML peers rethink LLM review norms and media races to keep up.
https://www.aistory.news/machine-learning/scientific-reports-precision-medicine-ai-goes-clinic-first/
#FederatedLearning #Quantization #ReinforcementLearning

#federatedlearning #quantization #reinforcementlearning
Habr @[email protected] · 2025-12-21 · 16:32 UTC

Сколько VRAM нужно для нейросетей?
Этот пост будет полезен людям, кто хочет разобраться в локальных моделях, особенно использующим их, как инструмент в создании контента, арта и дизайна (контекст нейросетей - image и video). Так же поговорим о выборе видеокарты и параметрах влияющих на генеративные workflow. Telegram
https://habr.com/ru/articles/979092/
#нейросеть_локально #нейросеть_для_генерации_изображений #видеокарты #quantization #comfyui #memory_bandwidth #vram #neural_networks #генеративные_модели

#генеративные_модели #neural_networks #vram #memory_bandwidth #comfyui #quantization
Habr @[email protected] · 2025-12-21 · 16:32 UTC

Сколько VRAM нужно для нейросетей?
Этот пост будет полезен людям, кто хочет разобраться в локальных моделях, особенно использующим их, как инструмент в создании контента, арта и дизайна (контекст нейросетей - image и video). Так же поговорим о выборе видеокарты и параметрах влияющих на генеративные workflow. Telegram
https://habr.com/ru/articles/979092/
#нейросеть_локально #нейросеть_для_генерации_изображений #видеокарты #quantization #comfyui #memory_bandwidth #vram #neural_networks #генеративные_модели

#генеративные_модели #neural_networks #vram #memory_bandwidth #comfyui #quantization
Habr @[email protected] · 2025-12-21 · 16:32 UTC

Сколько VRAM нужно для нейросетей?
Этот пост будет полезен людям, кто хочет разобраться в локальных моделях, особенно использующим их, как инструмент в создании контента, арта и дизайна (контекст нейросетей - image и video). Так же поговорим о выборе видеокарты и параметрах влияющих на генеративные workflow. Telegram
https://habr.com/ru/articles/979092/
#нейросеть_локально #нейросеть_для_генерации_изображений #видеокарты #quantization #comfyui #memory_bandwidth #vram #neural_networks #генеративные_модели

#генеративные_модели #neural_networks #vram #memory_bandwidth #comfyui #quantization
Habr @[email protected] · 2025-12-21 · 16:32 UTC

Сколько VRAM нужно для нейросетей?
Этот пост будет полезен людям, кто хочет разобраться в локальных моделях, особенно использующим их, как инструмент в создании контента, арта и дизайна (контекст нейросетей - image и video). Так же поговорим о выборе видеокарты и параметрах влияющих на генеративные workflow. Telegram
https://habr.com/ru/articles/979092/
#нейросеть_локально #нейросеть_для_генерации_изображений #видеокарты #quantization #comfyui #memory_bandwidth #vram #neural_networks #генеративные_модели

#нейросеть_локально #нейросеть_для_генерации_изображений #видеокарты #quantization #comfyui #memory_bandwidth
Ai Story News @aistorynews · 2025-12-14 · 00:08 UTC

NVIDIA deep learning courses add Earth-2, MONAI, and adversarial ML training, with free options and certificates for practitioners.
https://www.aistory.news/machine-learning/nvidia-deep-learning-courses-spotlight-practical-skills/
#FederatedLearning #Quantization #ReinforcementLearning

#federatedlearning #quantization #reinforcementlearning
Ai Story News @aistorynews · 2025-12-12 · 00:05 UTC

NVIDIA unveils Broadened Reinforcement Learning, using massive rollout scaling to boost LLM reasoning with less compute and stable rewards.
https://www.aistory.news/machine-learning/broadened-reinforcement-learning-adds-rollout-scaling/
#FederatedLearning #Quantization #ReinforcementLearning

#federatedlearning #quantization #reinforcementlearning
Ai Story News @aistorynews · 2025-12-11 · 00:04 UTC

NVIDIA expands its training catalog with a new Graph Neural Networks course, plus fresh modules on adversarial ML, Earth-2, and Jetson.
https://www.aistory.news/machine-learning/nvidia-adds-graph-neural-networks-course-to-lineup/
#FederatedLearning #Quantization #ReinforcementLearning

#federatedlearning #quantization #reinforcementlearning
Ai Story News @aistorynews · 2025-12-10 · 00:04 UTC

NVIDIA unveils an interactive AI agent that accelerates ML workflows with CUDA-X and Nemotron Nano-9B-v2, plus fresh training options.
https://www.aistory.news/machine-learning/nvidia-debuts-interactive-ai-agent-to-speed-ml-tasks/
#FederatedLearning #Quantization #ReinforcementLearning

#federatedlearning #quantization #reinforcementlearning
Ai Story News @aistorynews · 2025-12-09 · 00:07 UTC

NVIDIA expands its AI catalog with federated learning courses and modules on adversarial ML, Earth-2 weather models, and Jetson edge AI.
https://www.aistory.news/machine-learning/nvidia-adds-federated-learning-courses-to-ai-catalog/
#FederatedLearning #Quantization #ReinforcementLearning

#federatedlearning #quantization #reinforcementlearning
Ai Story News @aistorynews · 2025-12-08 · 00:06 UTC

Accelerated ML workflows get a boost as NVIDIA details a GPU-powered agent that speeds data prep, training, and HPO by up to 43x. Today.
https://www.aistory.news/machine-learning/accelerated-ml-workflows-arrive-with-nvidias-new-agent/
#FederatedLearning #Quantization #ReinforcementLearning

#federatedlearning #quantization #reinforcementlearning
Ai Story News @aistorynews · 2025-12-07 · 00:04 UTC

Limitless Pendant discontinued after Meta deal. Support continues for a year, features unlocked, and data export options offered to users.
https://www.aistory.news/machine-learning/limitless-pendant-discontinued-as-team-joins-meta/
#FederatedLearning #Quantization #ReinforcementLearning

#federatedlearning #quantization #reinforcementlearning
Ai Story News @aistorynews · 2025-12-06 · 00:08 UTC

Meta Limitless acquisition signals new AI wearables, while Isaac Lab 2.3 boosts robot learning with whole‑body control and teleoperation.
https://www.aistory.news/machine-learning/meta-limitless-acquisition-signals-bigger-ai-wearables-push/
#FederatedLearning #Quantization #ReinforcementLearning

#federatedlearning #quantization #reinforcementlearning
Ai Story News @aistorynews · 2025-12-05 · 00:08 UTC

NVIDIA's Isaac Lab Arena launches to benchmark robot policies at scale, with whole-body control, richer teleoperation data, ADR, and PBT.
https://www.aistory.news/machine-learning/isaac-lab-arena-debuts-for-scalable-robot-evaluation/
#FederatedLearning #Quantization #ReinforcementLearning

#federatedlearning #quantization #reinforcementlearning
Ai Story News @aistorynews · 2025-12-04 · 00:05 UTC

Call Reason beta leads Android’s AI updates, while Superhuman expands Ask AI and new ML courses arrive for developers.
https://www.aistory.news/machine-learning/call-reason-beta-headlines-androids-latest-ai-upgrades/
#FederatedLearning #Quantization #ReinforcementLearning

#federatedlearning #quantization #reinforcementlearning
Ai Story News @aistorynews · 2025-12-03 · 00:04 UTC

Android 16 AI features add notification summaries, spam checks and Expressive Captions, rolling out to Pixel devices with privacy controls.
https://www.aistory.news/machine-learning/android-16-ai-features-roll-out-to-pixel-phones-first/
#FederatedLearning #Quantization #ReinforcementLearning

#federatedlearning #quantization #reinforcementlearning
Ai Story News @aistorynews · 2025-12-02 · 00:06 UTC

Ecommerce anomaly detection gains urgency as the Shopify outage underscores needs for ML monitoring, adversarial defenses, and resilient
https://www.aistory.news/machine-learning/ecommerce-anomaly-detection-rises-after-shopify-outage/
#FederatedLearning #Quantization #ReinforcementLearning

#federatedlearning #quantization #reinforcementlearning
Reddit Tech VN Bot @[email protected] · 2025-11-29 · 15:19 UTC

Các phiên bản Qwen3-Next-80B-A3B GGUF mới đã có sẵn! Bao gồm lượng tử hóa imatrix và IQ, cùng với tối ưu hóa MoE, mang lại hiệu suất tốt hơn cho các mô hình LLM cục bộ.
#Qwen3Next #GGUF #LLM #AI #Quantization
#MôHìnhAI #LượngTửHóa #TríTuệNhânTạo
https://www.reddit.com/r/LocalLLaMA/comments/1p9qe7o/qwen3_next_imatrix_ggufs_up/

#qwen3next #gguf #llm #ai #quantization #mohinhai
Reddit Tech VN Bot @[email protected] · 2025-11-26 · 20:18 UTC

SGLang vừa giải quyết ổn định FP8 cho huấn luyện RL, phát hiện vấn đề nằm ở bước lượng tử hóa (quantization step). Đây là bước tiến lớn cho RLHF và tinh chỉnh RL cục bộ, giúp đơn giản hóa việc sử dụng độ chính xác hỗn hợp.
#SGLang #FP8 #RLTraining #Quantization #AI #MachineLearning #HuấnLuyệnRL #TríTuệNhânTạo #HọcMáy
https://www.reddit.com/r/LocalLLaMA/comments/1p7h5ah/sglang_just_solved_fp8_stability_for_rl_training/

#sglang #fp8 #rltraining #quantization #ai #machinelearning
Ai Story News @aistorynews · 2025-11-26 · 00:04 UTC

Nemotron Nano-9B-v2 powers a GPU-accelerated AI agent that automates ML workflows and boosts key tasks by up to 43x, per NVIDIA.
https://www.aistory.news/machine-learning/nemotron-nano-9b-v2-speeds-ml-agent-tasks-up-to-43x/
#FederatedLearning #Quantization #ReinforcementLearning

#federatedlearning #quantization #reinforcementlearning
Ai Story News @aistorynews · 2025-11-25 · 00:04 UTC

Claude Code updates add longer-running agents, Excel and Chrome tools, while NVIDIA debuts new RL rollout scaling and training paths.
https://www.aistory.news/machine-learning/claude-code-updates-bring-faster-smarter-ml-workflows/
#FederatedLearning #Quantization #ReinforcementLearning

#federatedlearning #quantization #reinforcementlearning
Ai Story News @aistorynews · 2025-11-24 · 00:04 UTC

Valve signals PC-aligned pricing, and Steam Machine AI like DLSS, FSR, and XeSS could shape performance expectations in the living room.
https://www.aistory.news/machine-learning/steam-machine-ai-stakes-rise-as-valve-signals-pricing/
#FederatedLearning #Quantization #ReinforcementLearning

#federatedlearning #quantization #reinforcementlearning
Ai Story News @aistorynews · 2025-11-23 · 00:04 UTC

NVIDIA’s CUDA-X Data Science shows 3x–43x ML speedups and expands training, pointing to faster, simpler workflows for teams and researchers.
https://www.aistory.news/machine-learning/cuda-x-data-science-brings-big-ml-speedups-in-new-demos/
#FederatedLearning #Quantization #ReinforcementLearning

#federatedlearning #quantization #reinforcementlearning