home.social

#llmbenchmarking — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #llmbenchmarking, aggregated by home.social.

  1. Anthropic just rolled out Claude Code at $200/month, while the new Claude 4 version climbs to the top of Berkeley’s tool‑calling leaderboard, beating open‑source rivals. Find out how Claude 4’s function‑calling shines and why Goose stays free. #Claude4 #FunctionCalling #BerkeleyLeaderboard #LLMBenchmarking

    🔗 aidailypost.com/news/claude-co