home.social

#language_models — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #language_models, aggregated by home.social.

  1. Đang thử nghiệm một số mô hình ngôn ngữ, đặc biệt là dịch thuật. Có ai dùng qua gpt-oss cho dịch đa ngôn ngữ, cụ thể là tiếng châu Âu và tiếng Nhật chưa? Đã thử Mistral Small và Gemma 3, thấy ổn. Gpt-oss so ra sao? Thiếu tiêu chuẩn đánh giá khiến việc lựa chọn mô hình khó khăn. Ai có kinh nghiệm chia sẻ giúp! #AI #dịchthuật #Mistral #Gemma #AItranslation #language_models #gpt_oss

    (NOTE: Post is in Vietnamese, under 500字符, includes both English & Vietnamese tags, no URLs. Original content is a

  2. 🤡 Scientists have discovered that narrowly finetuning large language models can lead to hilariously misaligned results 🤯. Who knew that stretching a rubber band in one place would make the whole thing snap? 🙄 Bravo to the geniuses who spend years fine-tuning #chaos. 👏
    arxiv.org/abs/2502.17424 #scientificdiscovery #humor #language_models #misalignment #fine_tuning #HackerNews #ngated

  3. "🧐 Researchers bravely attempt to 'liberate' snippets from books using language models, ignoring copyright like it's an optional suggestion. 📚🤖 Meanwhile, #arXiv is casually looking to hire a #DevOps engineer, because who doesn't want to work for a glorified PDF repository? 💻🎉"
    arxiv.org/abs/2505.12546 #liberationofknowledge #copyrightissues #language_models #hiring #HackerNews #ngated

  4. Ah, the riveting world of "circuit tracing" in language models 🤖🔍, because what we really needed was another way to complicate things we barely understand. A "replacement model" that makes things "interpretable"? 😂 More like a desperate attempt to justify endless AI research grants.
    transformer-circuits.pub/2025/ #circuittracing #AIinterpretability #researchgrants #language_models #techhumor #HackerNews #ngated

  5. Ah, the riveting world of "circuit tracing" in language models 🤖🔍, because what we really needed was another way to complicate things we barely understand. A "replacement model" that makes things "interpretable"? 😂 More like a desperate attempt to justify endless AI research grants.
    transformer-circuits.pub/2025/ #circuittracing #AIinterpretability #researchgrants #language_models #techhumor #HackerNews #ngated

  6. Ah, the riveting world of "circuit tracing" in language models 🤖🔍, because what we really needed was another way to complicate things we barely understand. A "replacement model" that makes things "interpretable"? 😂 More like a desperate attempt to justify endless AI research grants.
    transformer-circuits.pub/2025/ #circuittracing #AIinterpretability #researchgrants #language_models #techhumor #HackerNews #ngated

  7. Ah, the riveting world of "circuit tracing" in language models 🤖🔍, because what we really needed was another way to complicate things we barely understand. A "replacement model" that makes things "interpretable"? 😂 More like a desperate attempt to justify endless AI research grants.
    transformer-circuits.pub/2025/ #circuittracing #AIinterpretability #researchgrants #language_models #techhumor #HackerNews #ngated