home.social

#mmmlu — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #mmmlu, aggregated by home.social.

  1. New benchmark results show Mistral Large 3 outshining rivals across LMArena, MMMLU, AIME25 and GPQA Diamond tests. This open‑source LLM delivers top‑tier performance while staying community‑driven. Dive into the full analysis to see how it stacks up against Qwen‑14B and others. #MistralLarge3 #OpenSourceLLM #MMMLU #GPQADiamond

    🔗 aidailypost.com/news/mistral-l