#minitron — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #minitron, aggregated by home.social.
-
🧠 #NVIDIA researchers are advancing SLMs through structured weight pruning and knowledge distillation, cutting down model size while preserving performance.
For instance, #Minitron 8B and 4B models, derived from Nemotron 15B, outperform many models trained from scratch. Despite its size, it competes with top-tier models like #Gemma2 and #Phi2.
-
🧠 #NVIDIA researchers are advancing SLMs through structured weight pruning and knowledge distillation, cutting down model size while preserving performance.
For instance, #Minitron 8B and 4B models, derived from Nemotron 15B, outperform many models trained from scratch. Despite its size, it competes with top-tier models like #Gemma2 and #Phi2.
-
🧠 #NVIDIA researchers are advancing SLMs through structured weight pruning and knowledge distillation, cutting down model size while preserving performance.
For instance, #Minitron 8B and 4B models, derived from Nemotron 15B, outperform many models trained from scratch. Despite its size, it competes with top-tier models like #Gemma2 and #Phi2.
-
🧠 #NVIDIA researchers are advancing SLMs through structured weight pruning and knowledge distillation, cutting down model size while preserving performance.
For instance, #Minitron 8B and 4B models, derived from Nemotron 15B, outperform many models trained from scratch. Despite its size, it competes with top-tier models like #Gemma2 and #Phi2.
-
🧠 #NVIDIA researchers are advancing SLMs through structured weight pruning and knowledge distillation, cutting down model size while preserving performance.
For instance, #Minitron 8B and 4B models, derived from Nemotron 15B, outperform many models trained from scratch. Despite its size, it competes with top-tier models like #Gemma2 and #Phi2.