#humanityslastexam — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #humanityslastexam, aggregated by home.social.

AI Daily Post @[email protected] · 2025-12-18 · 13:42 UTC

Google's new Gemini 3 Flash promises faster AI with a leaner footprint, challenging larger frontier models. Early benchmark tests show it hitting 2.5‑model performance levels while staying more accessible. Could this be the answer to the ‘Humanity’s Last Exam’ of scaling? Dive into the details and see how parameter counts stack up. #Gemini3Flash #FrontierModels #2point5Models #HumanitysLastExam
🔗 https://aidailypost.com/news/gemini-3-flash-debuts-delivering-faster-ai-while-rivaling-larger

#gemini3flash #frontiermodels #2point5models #humanityslastexam
AI Daily Post @[email protected] · 2025-12-11 · 18:42 UTC

Gemini Deep Research agent just topped the Humanity’s Last Exam (HLE) and DeepSearchQA benchmarks, and now leads BrowseComp—outperforming Google Search and NotebookLM. The results showcase a new AI model’s capabilities and set a fresh standard for open‑source research tools. Curious how it did it? Read the full breakdown. #GeminiDeepResearch #HumanitysLastExam #DeepSearchQA #BrowseComp
🔗 https://aidailypost.com/news/gemini-deep-research-agent-posts-top-results-hle-deepsearchqa-leads

#geminideepresearch #humanityslastexam #deepsearchqa #browsecomp
IPN Kiel @[email protected] · 2025-01-24 · 13:52 UTC

Die Grenzen von KI austesten
Reuters & die New York Times berichten über einen neuen Test: Humanity's Last Exam. Mit 3.000 Fragen aus über 100 Themengebieten werden hier die Grenzen moderner KI-Systeme ausgetestet. Thorben Jansen vom IPN war an der Entwicklung beteiligt.
🔗 Mehr: https://lastexam.ai
New York Times: https://www.reuters.com/technology/artificial-intelligence/ai-experts-ready-humanitys-last-exam-stump-powerful-tech-2024-09-16/
Reuters: https://www.reuters.com/technology/artificial-intelligence/ai-experts-ready-humanitys-last-exam-stump-powerful-tech-2024-09-16/
#AI #AIBenchmark #KI #HumanitysLastExam

#ai #aibenchmark #ki #humanityslastexam