#distillation — Public Fediverse posts on home.social

GLOBAL Visibility aéPiot - by aePiot.ro @[email protected] · 2026-05-25 · 19:03 UTC

#CHARLES #FRANÇOIS #PROSPER DE #HEMRICOURT DE #GRÜNNE aepiot.com/advanced-sea... LA #LISEUSE aepiot.ro?lang=en&q=LA... #LOW #TEMPERATURE #DISTILLATION multi-search-tag-explorer.headlines-world.com/advanced-sea... #LIST OF #BRAZILIAN #MUSICIANS aepiot.com?lang=en&q=LI... allgraph.ro

MultiSearch Tag Explorer

#charles #francois #prosper #hemricourt #grunne #liseuse

Hacker News @[email protected] · 2026-05-12 · 19:28 UTC

Needle: We Distilled Gemini Tool Calling into a 26M Model

https://github.com/cactus-compute/needle

#HackerNews #Needle #Gemini #Tool #Model #AI #Distillation #Cactus #Compute

#hackernews #needle #gemini #tool #model #ai

Hacker News @[email protected] · 2026-05-12 · 19:28 UTC

Needle: We Distilled Gemini Tool Calling into a 26M Model

https://github.com/cactus-compute/needle

#HackerNews #Needle #Gemini #Tool #Model #AI #Distillation #Cactus #Compute

#hackernews #needle #gemini #tool #model #ai

Hacker News @[email protected] · 2026-05-12 · 19:28 UTC

Needle: We Distilled Gemini Tool Calling into a 26M Model

https://github.com/cactus-compute/needle

#HackerNews #Needle #Gemini #Tool #Model #AI #Distillation #Cactus #Compute

#hackernews #needle #gemini #tool #model #ai

Hacker News @[email protected] · 2026-05-12 · 19:28 UTC

Needle: We Distilled Gemini Tool Calling into a 26M Model

https://github.com/cactus-compute/needle

#HackerNews #Needle #Gemini #Tool #Model #AI #Distillation #Cactus #Compute

#compute #cactus #distillation #ai #model #tool

Hacker News @[email protected] · 2026-05-12 · 19:28 UTC

Needle: We Distilled Gemini Tool Calling into a 26M Model

https://github.com/cactus-compute/needle

#HackerNews #Needle #Gemini #Tool #Model #AI #Distillation #Cactus #Compute

#hackernews #needle #gemini #tool #model #ai

Habr @[email protected] · 2026-05-06 · 23:02 UTC

Разбираю «Qwen3.5-21B-Claude-4.6-Opus-Heretic-Uncensored»: что на самом деле внутри файнтюна с громким именем

В телеграме завирусился пост: якобы кто-то “дообучил Qwen 3.5 до уровня Claude 4.6 Opus и убрал цензуру через Heretic”. Я открыл карточку модели на HuggingFace и провёл вечер, разбираясь, что под капотом. Спойлер: там много интересной техники, но к Claude эта модель имеет такое же отношение, как кроссовки “Adibas” к Adidas. Разбираю distillation, depth upscaling и abliteration без маркетинговой обёртки.

https://habr.com/ru/articles/1032324/

#LLM #Qwen #abliteration #файнтюн #HuggingFace #distillation #intepretability #openweights

#llm #qwen #abliteration #файнтюн #huggingface #distillation

Habr @[email protected] · 2026-05-06 · 23:02 UTC

Разбираю «Qwen3.5-21B-Claude-4.6-Opus-Heretic-Uncensored»: что на самом деле внутри файнтюна с громким именем

В телеграме завирусился пост: якобы кто-то “дообучил Qwen 3.5 до уровня Claude 4.6 Opus и убрал цензуру через Heretic”. Я открыл карточку модели на HuggingFace и провёл вечер, разбираясь, что под капотом. Спойлер: там много интересной техники, но к Claude эта модель имеет такое же отношение, как кроссовки “Adibas” к Adidas. Разбираю distillation, depth upscaling и abliteration без маркетинговой обёртки.

https://habr.com/ru/articles/1032324/

#LLM #Qwen #abliteration #файнтюн #HuggingFace #distillation #intepretability #openweights

#openweights #intepretability #distillation #huggingface #файнтюн #abliteration

Habr @[email protected] · 2026-05-06 · 23:02 UTC

Разбираю «Qwen3.5-21B-Claude-4.6-Opus-Heretic-Uncensored»: что на самом деле внутри файнтюна с громким именем

В телеграме завирусился пост: якобы кто-то “дообучил Qwen 3.5 до уровня Claude 4.6 Opus и убрал цензуру через Heretic”. Я открыл карточку модели на HuggingFace и провёл вечер, разбираясь, что под капотом. Спойлер: там много интересной техники, но к Claude эта модель имеет такое же отношение, как кроссовки “Adibas” к Adidas. Разбираю distillation, depth upscaling и abliteration без маркетинговой обёртки.

https://habr.com/ru/articles/1032324/

#LLM #Qwen #abliteration #файнтюн #HuggingFace #distillation #intepretability #openweights

#openweights #intepretability #distillation #huggingface #файнтюн #abliteration

Habr @[email protected] · 2026-05-06 · 23:02 UTC

Разбираю «Qwen3.5-21B-Claude-4.6-Opus-Heretic-Uncensored»: что на самом деле внутри файнтюна с громким именем

В телеграме завирусился пост: якобы кто-то “дообучил Qwen 3.5 до уровня Claude 4.6 Opus и убрал цензуру через Heretic”. Я открыл карточку модели на HuggingFace и провёл вечер, разбираясь, что под капотом. Спойлер: там много интересной техники, но к Claude эта модель имеет такое же отношение, как кроссовки “Adibas” к Adidas. Разбираю distillation, depth upscaling и abliteration без маркетинговой обёртки.

https://habr.com/ru/articles/1032324/

#LLM #Qwen #abliteration #файнтюн #HuggingFace #distillation #intepretability #openweights

#openweights #intepretability #distillation #huggingface #файнтюн #abliteration

Some Bits: Nelson's Linkblog @[email protected] · 2026-04-30 · 21:14 UTC

Oil Refineries: Details on how oil processing works
https://www.construction-physics.com/p/how-an-oil-refinery-works
#distillation #cracking #refinery #energy #texas #oil #+

#distillation #cracking #refinery #energy #texas #oil

Some Bits: Nelson's Linkblog @[email protected] · 2026-04-30 · 21:14 UTC

Oil Refineries: Details on how oil processing works
https://www.construction-physics.com/p/how-an-oil-refinery-works
#distillation #cracking #refinery #energy #texas #oil #+

#distillation #cracking #refinery #energy #texas #oil

Some Bits: Nelson's Linkblog @[email protected] · 2026-04-30 · 21:14 UTC

Oil Refineries: Details on how oil processing works
https://www.construction-physics.com/p/how-an-oil-refinery-works
#distillation #cracking #refinery #energy #texas #oil #+

#distillation #cracking #refinery #energy #texas #oil

Some Bits: Nelson's Linkblog @[email protected] · 2026-04-30 · 21:14 UTC

Oil Refineries: Details on how oil processing works
https://www.construction-physics.com/p/how-an-oil-refinery-works
#distillation #cracking #refinery #energy #texas #oil #+

#oil #texas #energy #refinery #cracking #distillation

deepseek @[email protected] · 2026-04-28 · 01:30 UTC

Distillation Diplomacy: State Department’s Cable Names Chinese AI Firms in US IP Theft Escalation U.S. State Department cable targets DeepSeek, Moonshot AI, and MiniMax for distilling American AI...

#AISecurityPro #AI #Distillation #Anthropic #Claude #China #AI #Deepseek #IP #theft #MiniMax

Origin | Interest | Match

#aisecuritypro #ai #distillation #anthropic #claude #china

tech news ᳇ eicker.news @[email protected] · 2026-04-24 · 04:15 UTC

The #US is preparing to crack down on #China’s alleged “industrial-scale theft” of #AI #intellectualproperty through #distillation attacks. The US government is exploring measures to hold foreign actors accountable, potentially including prosecuting bad actors and imposing penalties. China has denied the allegations, calling them “pure slander.” https://arstechnica.com/tech-policy/2026/04/us-accuses-china-of-industrial-scale-ai-theft-china-says-its-slander/?eicker.news #tech #media #news

#us #china #ai #intellectualproperty #distillation #tech

deepseek @[email protected] · 2026-04-24 · 02:07 UTC

Anthropic accuses Chinese labs of illicit AI model distillation using 24,000 fake accounts Anthropic identified industrial-scale distillation campaigns by three Chinese AI labs—DeepSeek, Moonshot...

#Technology #AI #distillation #Anthropic #bioweapon #development #risk #DeepSeek #MiniMax #Moonshot #White

Origin | Interest | Match

#technology #ai #distillation #anthropic #bioweapon #development

deepseek @[email protected] · 2026-04-23 · 21:45 UTC

US accuses China of “industrial-scale” AI theft. China says it’s “slander.” Trump-Xi summit may be rocked by US mulling huge sanctions. The US is preparing to crack down on China's al...

#AI #Policy #ai #theft #Anthropic #china #Distillation #Donald #Trump #google #intellectual

Origin | Interest | Match

#ai #policy #theft #anthropic #china #distillation

Afrique @[email protected] · 2026-04-21 · 05:37 UTC

https://www.europesays.com/afrique/81466/ Bruxelles répond des 15 millions € au vignoble sud-africain, mais pas à la colère des vignerons français #AfriqueDuSud #Arrachage #Distillation #Europe #EuropeAfrique #JeanMarieFabre #PACCommissionEuropéenne #UE #UEAfrique #UnionEuropéenne #UnionEuropéenneAfrique #VigneronsIndépendants

#vigneronsindependants #unioneuropeenneafrique #unioneuropeenne #ueafrique #ue #paccommissioneuropeenne

Quasit @[email protected] · 2026-04-08 · 03:21 UTC

It seems to me that we should be doing things like creating solar powered glass distillers that use built-in magnifying glasses to vaporize and distill water. It shouldn't be that hard to do. Make them out of heavy glass, and give everyone in the world permission to make them. The way things are going, we are going to need cheap and easy ways to purify water that don't rely on electricity.

I can •almost• visualize the design.

#water #distillation #Technology #Tech #solar #glass

#water #distillation #technology #tech #solar #glass

Gehtso @[email protected] · 2026-04-02 · 15:40 UTC

Ukraine’s drones return to Bashneft-Novoil refinery in Ufa 1,300+ km from the front — the primary distillation unit is burning again

Without the AVT unit, which performs the first stage of crude processing, the rest of the refinery cannot operate.

https://euromaidanpress.com/2026/04/02/ukraines-drones-return-to-bashneft-novoil-refinery-in-ufa-1300-km-from-the-front-the-primary-distillation-unit-is-burning-again/

#WarOfAggression #Europa #Ukraine #Ufa #refinery #BashneftNovoil #distillation #Bashkortostan #oil #warfare #army #war #Russia #WarCriminal #invaders #occupiers
#перемогаYкраїни

#warofaggression #europa #ukraine #ufa #refinery #bashneftnovoil

Gehtso @[email protected] · 2026-04-02 · 15:40 UTC

Ukraine’s drones return to Bashneft-Novoil refinery in Ufa 1,300+ km from the front — the primary distillation unit is burning again

Without the AVT unit, which performs the first stage of crude processing, the rest of the refinery cannot operate.

https://euromaidanpress.com/2026/04/02/ukraines-drones-return-to-bashneft-novoil-refinery-in-ufa-1300-km-from-the-front-the-primary-distillation-unit-is-burning-again/

#WarOfAggression #Europa #Ukraine #Ufa #refinery #BashneftNovoil #distillation #Bashkortostan #oil #warfare #army #war #Russia #WarCriminal #invaders #occupiers
#перемогаYкраїни

#warofaggression #europa #ukraine #ufa #refinery #bashneftnovoil

Gehtso @[email protected] · 2026-04-02 · 15:40 UTC

Ukraine’s drones return to Bashneft-Novoil refinery in Ufa 1,300+ km from the front — the primary distillation unit is burning again

Without the AVT unit, which performs the first stage of crude processing, the rest of the refinery cannot operate.

https://euromaidanpress.com/2026/04/02/ukraines-drones-return-to-bashneft-novoil-refinery-in-ufa-1300-km-from-the-front-the-primary-distillation-unit-is-burning-again/

#WarOfAggression #Europa #Ukraine #Ufa #refinery #BashneftNovoil #distillation #Bashkortostan #oil #warfare #army #war #Russia #WarCriminal #invaders #occupiers
#перемогаYкраїни

#warofaggression #europa #ukraine #ufa #refinery #bashneftnovoil

Gehtso @[email protected] · 2026-04-02 · 15:40 UTC

Ukraine’s drones return to Bashneft-Novoil refinery in Ufa 1,300+ km from the front — the primary distillation unit is burning again

Without the AVT unit, which performs the first stage of crude processing, the rest of the refinery cannot operate.

https://euromaidanpress.com/2026/04/02/ukraines-drones-return-to-bashneft-novoil-refinery-in-ufa-1300-km-from-the-front-the-primary-distillation-unit-is-burning-again/

#WarOfAggression #Europa #Ukraine #Ufa #refinery #BashneftNovoil #distillation #Bashkortostan #oil #warfare #army #war #Russia #WarCriminal #invaders #occupiers
#перемогаYкраїни

#перемогаyкраїни #occupiers #invaders #warcriminal #russia #war

Gehtso @[email protected] · 2026-04-02 · 15:40 UTC

Ukraine’s drones return to Bashneft-Novoil refinery in Ufa 1,300+ km from the front — the primary distillation unit is burning again

Without the AVT unit, which performs the first stage of crude processing, the rest of the refinery cannot operate.

https://euromaidanpress.com/2026/04/02/ukraines-drones-return-to-bashneft-novoil-refinery-in-ufa-1300-km-from-the-front-the-primary-distillation-unit-is-burning-again/

#WarOfAggression #Europa #Ukraine #Ufa #refinery #BashneftNovoil #distillation #Bashkortostan #oil #warfare #army #war #Russia #WarCriminal #invaders #occupiers
#перемогаYкраїни

#warofaggression #europa #ukraine #ufa #refinery #bashneftnovoil

Bordeaux @[email protected] · 2026-03-18 · 10:30 UTC

c’est se foutre de la gueule du monde »

Coup de gueule dans le vignoble bordelais où l’on exige une revalorisation d’urgence des prix de la distillation…
#Bordeaux #FR #France #Actu #News #Europe #EU #actu #Actualités #Arrachage #Distillation #europe #NouvelleAquitaine #Républiquefrançaise
https://www.europesays.com/fr/807210/

#republiquefrancaise #nouvelleaquitaine #distillation #arrachage #actualites #eu

Hacker News @[email protected] · 2026-03-15 · 03:22 UTC

Mathematics Distillation Challenge – Equational Theories

https://terrytao.wordpress.com/2026/03/13/mathematics-distillation-challenge-equational-theories/

#HackerNews #Mathematics #Distillation #Challenge #Equational #Theories #TerryTao #MathChallenge

#hackernews #mathematics #distillation #challenge #equational #theories

Hacker News @[email protected] · 2026-03-15 · 03:22 UTC

Mathematics Distillation Challenge – Equational Theories

https://terrytao.wordpress.com/2026/03/13/mathematics-distillation-challenge-equational-theories/

#HackerNews #Mathematics #Distillation #Challenge #Equational #Theories #TerryTao #MathChallenge

#hackernews #mathematics #distillation #challenge #equational #theories

Hacker News @[email protected] · 2026-03-15 · 03:22 UTC

Mathematics Distillation Challenge – Equational Theories

https://terrytao.wordpress.com/2026/03/13/mathematics-distillation-challenge-equational-theories/

#HackerNews #Mathematics #Distillation #Challenge #Equational #Theories #TerryTao #MathChallenge

#hackernews #mathematics #distillation #challenge #equational #theories

Hacker News @[email protected] · 2026-03-15 · 03:22 UTC

Mathematics Distillation Challenge – Equational Theories

https://terrytao.wordpress.com/2026/03/13/mathematics-distillation-challenge-equational-theories/

#HackerNews #Mathematics #Distillation #Challenge #Equational #Theories #TerryTao #MathChallenge

#mathchallenge #terrytao #theories #equational #challenge #distillation

Hacker News @[email protected] · 2026-03-15 · 03:22 UTC

Mathematics Distillation Challenge – Equational Theories

https://terrytao.wordpress.com/2026/03/13/mathematics-distillation-challenge-equational-theories/

#HackerNews #Mathematics #Distillation #Challenge #Equational #Theories #TerryTao #MathChallenge

#hackernews #mathematics #distillation #challenge #equational #theories

deepseek @[email protected] · 2026-02-28 · 19:59 UTC

3 Steps to Distill LLMs: Shrink Your Model and Save Money Chinese AI labs like DeepSeek and Moonshot didn’t invent distillation, but they showed the world what it can do. They built models that...

#llm #llmops #mlops #distillation #machine-learning

Origin | Interest | Match

#llm #llmops #mlops #distillation #machinelearning

deepseek @[email protected] · 2026-02-26 · 00:51 UTC

1600万次偷跑曝光！Deepseek与Minimax如何偷偷蒸馏Anthropic？你是否在烦恼国产AI的真实水平究竟如何？近日Anthropic指控Deepseek、Kimi和Minimax违规“白嫖”数...

#AIGC #AI抄袭争议 #Anthropic #Claude #Minimax大模型 #Model #Distillation #大模型刷真题 #大模型蒸馏 #应对DeepSeek的连招 #账号混淆调用API

Origin | Interest | Match

#aigc #ai抄袭争议 #anthropic #claude #minimax大模型 #model

PKs Powerfromspace1 @[email protected] · 2026-02-25 · 20:58 UTC

@MatthewBerman

They Got Caught...

#deepseek #anthropic #distillation #ai

https://youtu.be/VmEa3fVvZDw?si=CB6fC7BTpwp3fN9Y

( ed: skiing is a bit strong they were being cheeky)

#deepseek #anthropic #distillation #ai

Mac4Ever @[email protected] · 2026-02-24 · 11:22 UTC

Anthropic accuse DeepSeek Moonshot et MiniMax d’avoir copié son IA Claude
https://mac4ever.com/194847
#Mac4Ever #Anthropic #Claude #DeepSeek #Distillation

#mac4ever #anthropic #claude #deepseek #distillation

United States @[email protected] · 2026-02-24 · 03:15 UTC

Anthropic says Chinese companies misused Claude AI; Elon Musk lashes out

Elon Musk on Monday lashed out at Anthropic after the Dario Amodei-led company accused Chinese AI companies of…
#UnitedStates #US #USA #AILabs #anthropicdatastealin #anthropicstealingdata #anthrpoicai #Claude #ClaudeAImodel #claudecod #datatheft #distillation #ElonMusk #elonmuskonanthropic #industrial-scaledistillationattacks #Musk
https://www.europesays.com/2801482/

#musk #industrial #elonmuskonanthropic #elonmusk #distillation #datatheft

deepseek @[email protected] · 2026-02-23 · 22:31 UTC

Anthropic Rallies Industry to Combat AI Model Theft Anthropic said Monday (Feb. 23) that the Chinese artificial intelligence labs DeepSeek, MiniMax and Moonshot AI have illicitly used the outputs o...

#artificial #intelligence #AI #AI #model #theft #Anthropic #DeepSeek #distillation #News #PYMNTS

Origin | Interest | Match

#artificial #intelligence #ai #model #theft #anthropic

deepseek @[email protected] · 2026-02-23 · 19:57 UTC

Anthropic accuses Chinese AI labs of mining Claude as US debates AI chip exports Anthropic accuses DeepSeek, Moonshot, and MiniMax of using 24,000 fake accounts to distill Claude’s AI capabilitie...

#AI #Government #& #Policy #Anthropic #deepseek #distillation #Exclusive #minimax #moonshot #ai

Origin | Interest | Match

#ai #government #policy #anthropic #deepseek #distillation

Hacker News @[email protected] · 2026-02-23 · 19:13 UTC

Anthropic announces proof of distillation at scale by MiniMax, DeepSeek,Moonshot

https://twitter.com/anthropicai/status/2025997928242811253

#HackerNews #Anthropic #MiniMax #DeepSeek #Moonshot #AI #distillation

#hackernews #anthropic #minimax #deepseek #moonshot #ai

1337 $#!+ I did that @[email protected] · 2026-02-16 · 02:18 UTC

After scraping all that #copyright, #bigai deserves this #karma. And We The People get all the open weight models. Hey, publishers are not your friends either, remember the #mpaa trying to send Moms to prison? #distillation is all kinds of comeuppance. #AI #LLM It all is leaking into the #publicdomain !!!

https://www.theregister.com/2026/02/14/ai_risk_distillation_attacks/

#copyright #bigai #karma #mpaa #distillation #ai

1337 $#!+ I did that @[email protected] · 2026-02-16 · 02:18 UTC

After scraping all that #copyright, #bigai deserves this #karma. And We The People get all the open weight models. Hey, publishers are not your friends either, remember the #mpaa trying to send Moms to prison? #distillation is all kinds of comeuppance. #AI #LLM It all is leaking into the #publicdomain !!!

https://www.theregister.com/2026/02/14/ai_risk_distillation_attacks/

#copyright #bigai #karma #mpaa #distillation #ai

1337 $#!+ I did that @[email protected] · 2026-02-16 · 02:18 UTC

After scraping all that #copyright, #bigai deserves this #karma. And We The People get all the open weight models. Hey, publishers are not your friends either, remember the #mpaa trying to send Moms to prison? #distillation is all kinds of comeuppance. #AI #LLM It all is leaking into the #publicdomain !!!

https://www.theregister.com/2026/02/14/ai_risk_distillation_attacks/

#copyright #bigai #karma #mpaa #distillation #ai

1337 $#!+ I did that @[email protected] · 2026-02-16 · 02:18 UTC

After scraping all that #copyright, #bigai deserves this #karma. And We The People get all the open weight models. Hey, publishers are not your friends either, remember the #mpaa trying to send Moms to prison? #distillation is all kinds of comeuppance. #AI #LLM It all is leaking into the #publicdomain !!!

https://www.theregister.com/2026/02/14/ai_risk_distillation_attacks/

#publicdomain #llm #ai #distillation #mpaa #karma

1337 $#!+ I did that @[email protected] · 2026-02-16 · 02:18 UTC

After scraping all that #copyright, #bigai deserves this #karma. And We The People get all the open weight models. Hey, publishers are not your friends either, remember the #mpaa trying to send Moms to prison? #distillation is all kinds of comeuppance. #AI #LLM It all is leaking into the #publicdomain !!!

https://www.theregister.com/2026/02/14/ai_risk_distillation_attacks/

#copyright #bigai #karma #mpaa #distillation #ai

Habr @[email protected] · 2026-02-01 · 18:52 UTC

QAD от NVIDIA: разбираюсь, почему 4-битная квантизация перестала всё ломать

NVIDIA выпустила отчет о методе QAD, который позволяет квантовать LLM в 4 бита без потери качества на сложных задачах (математика, код). Разбираем, почему привычный QAT «ломает» модели после RLHF, как дистилляция через KL-дивергенцию решает эту проблему и почему метод работает даже на рандомных данных. Личный опыт попыток уместить 49B модель в железо и анализ нового подхода.

https://habr.com/ru/articles/991586/

#LLM #Квантизация #NVIDIA #QAD #QAT #FP4 #Blackwell #Machine_Learning #Llama #Distillation

#distillation #llama #machine_learning #blackwell #fp4 #qat

Habr @[email protected] · 2026-02-01 · 18:52 UTC

QAD от NVIDIA: разбираюсь, почему 4-битная квантизация перестала всё ломать

NVIDIA выпустила отчет о методе QAD, который позволяет квантовать LLM в 4 бита без потери качества на сложных задачах (математика, код). Разбираем, почему привычный QAT «ломает» модели после RLHF, как дистилляция через KL-дивергенцию решает эту проблему и почему метод работает даже на рандомных данных. Личный опыт попыток уместить 49B модель в железо и анализ нового подхода.

https://habr.com/ru/articles/991586/

#LLM #Квантизация #NVIDIA #QAD #QAT #FP4 #Blackwell #Machine_Learning #Llama #Distillation

#distillation #llama #machine_learning #blackwell #fp4 #qat

Habr @[email protected] · 2026-02-01 · 18:52 UTC

QAD от NVIDIA: разбираюсь, почему 4-битная квантизация перестала всё ломать

NVIDIA выпустила отчет о методе QAD, который позволяет квантовать LLM в 4 бита без потери качества на сложных задачах (математика, код). Разбираем, почему привычный QAT «ломает» модели после RLHF, как дистилляция через KL-дивергенцию решает эту проблему и почему метод работает даже на рандомных данных. Личный опыт попыток уместить 49B модель в железо и анализ нового подхода.

https://habr.com/ru/articles/991586/

#LLM #Квантизация #NVIDIA #QAD #QAT #FP4 #Blackwell #Machine_Learning #Llama #Distillation

#llm #квантизация #nvidia #qad #qat #fp4

Habr @[email protected] · 2026-02-01 · 18:52 UTC

QAD от NVIDIA: разбираюсь, почему 4-битная квантизация перестала всё ломать

NVIDIA выпустила отчет о методе QAD, который позволяет квантовать LLM в 4 бита без потери качества на сложных задачах (математика, код). Разбираем, почему привычный QAT «ломает» модели после RLHF, как дистилляция через KL-дивергенцию решает эту проблему и почему метод работает даже на рандомных данных. Личный опыт попыток уместить 49B модель в железо и анализ нового подхода.

https://habr.com/ru/articles/991586/

#LLM #Квантизация #NVIDIA #QAD #QAT #FP4 #Blackwell #Machine_Learning #Llama #Distillation

#distillation #llama #machine_learning #blackwell #fp4 #qat