#quantization — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #quantization, aggregated by home.social.
-
An excellent introduction to #quantization used for #LLMs 👌🏽:
“Quantization From The Ground Up”, Sam Rose, Ngrok (https://ngrok.com/blog/quantization).
On HN: https://news.ycombinator.com/item?id=47519295
#AI #Math #FloatingPoint #NumericalAnalysis #Numbers #NeuralNetworks #Precision #Accuracy
-
An excellent introduction to #quantization used for #LLMs 👌🏽:
“Quantization From The Ground Up”, Sam Rose, Ngrok (https://ngrok.com/blog/quantization).
On HN: https://news.ycombinator.com/item?id=47519295
#AI #Math #FloatingPoint #NumericalAnalysis #Numbers #NeuralNetworks #Precision #Accuracy
-
An excellent introduction to #quantization used for #LLMs 👌🏽:
“Quantization From The Ground Up”, Sam Rose, Ngrok (https://ngrok.com/blog/quantization).
On HN: https://news.ycombinator.com/item?id=47519295
#AI #Math #FloatingPoint #NumericalAnalysis #Numbers #NeuralNetworks #Precision #Accuracy
-
An excellent introduction to #quantization used for #LLMs 👌🏽:
“Quantization From The Ground Up”, Sam Rose, Ngrok (https://ngrok.com/blog/quantization).
On HN: https://news.ycombinator.com/item?id=47519295
#AI #Math #FloatingPoint #NumericalAnalysis #Numbers #NeuralNetworks #Precision #Accuracy
-
Impressive:
“TurboQuant: Redefining AI Efficiency With Extreme Compression”, Amir Zandieh, et al, Google Research (https://research.google/blog/turboquant-redefining-ai-efficiency-with-extreme-compression/).
The paper: https://arxiv.org/abs/2504.19874
On HN: https://news.ycombinator.com/item?id=47513475
#TurboQuant #Quantization #LLMs #Vectors #Compression #Paper
-
Authors: Federico Marcuzzi (INSAIT - Institute for Computer Science, Artificial Intelligence and Technology), Xuefei Ning (Tsinghua University), Roy Schwartz (The Hebrew University of Jerusalem), and Iryna Gurevych (UKP Lab, Technische Universität Darmstadt and ATHENE Center).
See you at #EACL2026 in Rabat 🕌!
#UKPLab #NLProc #ResponsibleAI #Quantization #MLSafety #Fairness #TrustworthyAI #ModelCompression #LLMSafety #EthicalAI #NLP #AIResearch
-
🎉 Wow, an article longer than the collective thoughts of its intended audience! Sam Rose seems to think we're all aspiring data scientists with infinite free time and an endless love for #quantization. 😂 6,658 words later, we're left with an 80 billion-parameter headache and absolutely zero desire to quantize anything ever again. 🚀🔢
https://ngrok.com/blog/quantization #HackerNews #DataScience #LongRead #Humor #HackerNews #ngated -
Quantization from the Ground Up
https://ngrok.com/blog/quantization
#HackerNews #Quantization #Ground #Up #Machine #Learning #AI #Technology #Blog
-
Сколько VRAM нужно для нейросетей?
Этот пост будет полезен людям, кто хочет разобраться в локальных моделях, особенно использующим их, как инструмент в создании контента, арта и дизайна (контекст нейросетей - image и video). Так же поговорим о выборе видеокарты и параметрах влияющих на генеративные workflow. Telegram
https://habr.com/ru/articles/979092/
#нейросеть_локально #нейросеть_для_генерации_изображений #видеокарты #quantization #comfyui #memory_bandwidth #vram #neural_networks #генеративные_модели
-
🔬🤯 Modele 1-bitowe to rewolucja w AI! Wagi sieci neuronowej zapisujemy tylko 1 bitem – zamiast 32 czy 16. To nawet 16x mniejszy rozmiar i ogromne oszczędności energii, przy zachowaniu jakości klasycznych LLM. Przyszłość AI jest lekka! 🚀#AI #LLM #quantization #BitNet
-
Google Releases Gemma 3 QAT AI Models for Consumer GPUs
#AI #AIModels #GoogleAI #Gemma3 #LLM #OpenSourceAI #GPUs #QAT #Quantization #DeepLearning #MachineLearning #NVIDIA #RTX3090 #Kaggle
https://winbuzzer.com/2025/04/20/google-releases-gemma-3-qat-ai-models-for-consumer-gpus-xcxwbn/
-
'Topological Analysis for Detecting Anomalies in dependent sequences: application to Time Series', by Frédéric Chazal, Clément Levrard, Martin Royer.
http://jmlr.org/papers/v25/24-0853.html
#topological #quantization #anomalies -
'Nonparametric Inference under B-bits Quantization', by Kexuan Li, Ruiqi Liu, Ganggang Xu, Zuofeng Shang.
http://jmlr.org/papers/v25/20-075.html
#nonparametric #spline #quantization -
'Nonparametric Inference under B-bits Quantization', by Kexuan Li, Ruiqi Liu, Ganggang Xu, Zuofeng Shang.
http://jmlr.org/papers/v25/20-075.html
#nonparametric #spline #quantization -
'Q-Learning for MDPs with General Spaces: Convergence and Near Optimality via Quantization under Weak Continuity', by Ali Kara, Naci Saldi, Serdar Yüksel.
http://jmlr.org/papers/v24/21-1457.html
#quantization #quantized #mdps