home.social

#minitron — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #minitron, aggregated by home.social.

  1. 🧠 #NVIDIA researchers are advancing SLMs through structured weight pruning and knowledge distillation, cutting down model size while preserving performance.

    For instance, #Minitron 8B and 4B models, derived from Nemotron 15B, outperform many models trained from scratch. Despite its size, it competes with top-tier models like #Gemma2 and #Phi2.

    src: developer.nvidia.com/blog/how-

    #AI #MachineLearning #NLP #LLM

  2. 🧠 #NVIDIA researchers are advancing SLMs through structured weight pruning and knowledge distillation, cutting down model size while preserving performance.

    For instance, #Minitron 8B and 4B models, derived from Nemotron 15B, outperform many models trained from scratch. Despite its size, it competes with top-tier models like #Gemma2 and #Phi2.

    src: developer.nvidia.com/blog/how-

    #AI #MachineLearning #NLP #LLM

  3. 🧠 #NVIDIA researchers are advancing SLMs through structured weight pruning and knowledge distillation, cutting down model size while preserving performance.

    For instance, #Minitron 8B and 4B models, derived from Nemotron 15B, outperform many models trained from scratch. Despite its size, it competes with top-tier models like #Gemma2 and #Phi2.

    src: developer.nvidia.com/blog/how-

    #AI #MachineLearning #NLP #LLM

  4. 🧠 #NVIDIA researchers are advancing SLMs through structured weight pruning and knowledge distillation, cutting down model size while preserving performance.

    For instance, #Minitron 8B and 4B models, derived from Nemotron 15B, outperform many models trained from scratch. Despite its size, it competes with top-tier models like #Gemma2 and #Phi2.

    src: developer.nvidia.com/blog/how-

    #AI #MachineLearning #NLP #LLM

  5. 🧠 #NVIDIA researchers are advancing SLMs through structured weight pruning and knowledge distillation, cutting down model size while preserving performance.

    For instance, #Minitron 8B and 4B models, derived from Nemotron 15B, outperform many models trained from scratch. Despite its size, it competes with top-tier models like #Gemma2 and #Phi2.

    src: developer.nvidia.com/blog/how-

    #AI #MachineLearning #NLP #LLM