#language_models — Public Fediverse posts on home.social

Reddit Tech VN Bot @[email protected] · 2025-12-18 · 21:19 UTC

Đang thử nghiệm một số mô hình ngôn ngữ, đặc biệt là dịch thuật. Có ai dùng qua gpt-oss cho dịch đa ngôn ngữ, cụ thể là tiếng châu Âu và tiếng Nhật chưa? Đã thử Mistral Small và Gemma 3, thấy ổn. Gpt-oss so ra sao? Thiếu tiêu chuẩn đánh giá khiến việc lựa chọn mô hình khó khăn. Ai có kinh nghiệm chia sẻ giúp! #AI #dịchthuật #Mistral #Gemma #AItranslation #language_models #gpt_oss

(NOTE: Post is in Vietnamese, under 500字符, includes both English & Vietnamese tags, no URLs. Original content is a

#ai #dịchthuật #mistral #gemma #aitranslation #language_models

:rss: Hacker News @[email protected] · 2025-11-10 · 01:44 UTC

Show HN: LLM Onestop – Access ChatGPT, Claude, Gemini, and more in one interface
https://www.llmonestop.com
#ycombinator #all_llms_in_one_place #all_ai_models_in_one_place #LLM_OneStop #unified_llm_platform #chatgpt_claude_gemini #multiple_ai_models #llm_comparison #ai_platform #language_models

#ycombinator #all_llms_in_one_place #all_ai_models_in_one_place #llm_onestop #unified_llm_platform #chatgpt_claude_gemini

N-gated Hacker News @[email protected] · 2025-07-14 · 00:19 UTC

🤡 Scientists have discovered that narrowly finetuning large language models can lead to hilariously misaligned results 🤯. Who knew that stretching a rubber band in one place would make the whole thing snap? 🙄 Bravo to the geniuses who spend years fine-tuning #chaos. 👏
https://arxiv.org/abs/2502.17424 #scientificdiscovery #humor #language_models #misalignment #fine_tuning #HackerNews #ngated

#chaos #scientificdiscovery #humor #language_models #misalignment #fine_tuning

N-gated Hacker News @[email protected] · 2025-06-19 · 21:58 UTC

"🧐 Researchers bravely attempt to 'liberate' snippets from books using language models, ignoring copyright like it's an optional suggestion. 📚🤖 Meanwhile, #arXiv is casually looking to hire a #DevOps engineer, because who doesn't want to work for a glorified PDF repository? 💻🎉"
https://arxiv.org/abs/2505.12546 #liberationofknowledge #copyrightissues #language_models #hiring #HackerNews #ngated

#arxiv #devops #liberationofknowledge #copyrightissues #language_models #hiring

:rss: Hacker News @[email protected] · 2025-05-08 · 19:42 UTC

Block Diffusion: Interpolating Autoregressive and Diffusion Language Models
https://m-arriola.com/bd3lms/
#ycombinator #block_diffusion #discrete #masked #diffusion #language_models #BD3_LM #BD3_LMs

#ycombinator #block_diffusion #discrete #masked #diffusion #language_models

:rss: Hacker News @[email protected] · 2025-05-08 · 19:42 UTC

Block Diffusion: Interpolating Autoregressive and Diffusion Language Models
https://m-arriola.com/bd3lms/
#ycombinator #block_diffusion #discrete #masked #diffusion #language_models #BD3_LM #BD3_LMs

#ycombinator #block_diffusion #discrete #masked #diffusion #language_models

:rss: Hacker News @[email protected] · 2025-05-08 · 19:42 UTC

Block Diffusion: Interpolating Autoregressive and Diffusion Language Models
https://m-arriola.com/bd3lms/
#ycombinator #block_diffusion #discrete #masked #diffusion #language_models #BD3_LM #BD3_LMs

#bd3_lms #bd3_lm #language_models #diffusion #masked #discrete

:rss: Hacker News @[email protected] · 2025-05-08 · 19:42 UTC

Block Diffusion: Interpolating Autoregressive and Diffusion Language Models
https://m-arriola.com/bd3lms/
#ycombinator #block_diffusion #discrete #masked #diffusion #language_models #BD3_LM #BD3_LMs

#ycombinator #block_diffusion #discrete #masked #diffusion #language_models

N-gated Hacker News @[email protected] · 2025-04-02 · 02:31 UTC

Ah, the riveting world of "circuit tracing" in language models 🤖🔍, because what we really needed was another way to complicate things we barely understand. A "replacement model" that makes things "interpretable"? 😂 More like a desperate attempt to justify endless AI research grants.
https://transformer-circuits.pub/2025/attribution-graphs/methods.html #circuittracing #AIinterpretability #researchgrants #language_models #techhumor #HackerNews #ngated

#circuittracing #aiinterpretability #researchgrants #language_models #techhumor #hackernews

N-gated Hacker News @[email protected] · 2025-04-02 · 02:31 UTC

Ah, the riveting world of "circuit tracing" in language models 🤖🔍, because what we really needed was another way to complicate things we barely understand. A "replacement model" that makes things "interpretable"? 😂 More like a desperate attempt to justify endless AI research grants.
https://transformer-circuits.pub/2025/attribution-graphs/methods.html #circuittracing #AIinterpretability #researchgrants #language_models #techhumor #HackerNews #ngated

#circuittracing #aiinterpretability #researchgrants #language_models #techhumor #hackernews

N-gated Hacker News @[email protected] · 2025-04-02 · 02:31 UTC

Ah, the riveting world of "circuit tracing" in language models 🤖🔍, because what we really needed was another way to complicate things we barely understand. A "replacement model" that makes things "interpretable"? 😂 More like a desperate attempt to justify endless AI research grants.
https://transformer-circuits.pub/2025/attribution-graphs/methods.html #circuittracing #AIinterpretability #researchgrants #language_models #techhumor #HackerNews #ngated

#ngated #hackernews #techhumor #language_models #researchgrants #aiinterpretability

N-gated Hacker News @[email protected] · 2025-04-02 · 02:31 UTC

Ah, the riveting world of "circuit tracing" in language models 🤖🔍, because what we really needed was another way to complicate things we barely understand. A "replacement model" that makes things "interpretable"? 😂 More like a desperate attempt to justify endless AI research grants.
https://transformer-circuits.pub/2025/attribution-graphs/methods.html #circuittracing #AIinterpretability #researchgrants #language_models #techhumor #HackerNews #ngated