#language_models — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #language_models, aggregated by home.social.
-
Đang thử nghiệm một số mô hình ngôn ngữ, đặc biệt là dịch thuật. Có ai dùng qua gpt-oss cho dịch đa ngôn ngữ, cụ thể là tiếng châu Âu và tiếng Nhật chưa? Đã thử Mistral Small và Gemma 3, thấy ổn. Gpt-oss so ra sao? Thiếu tiêu chuẩn đánh giá khiến việc lựa chọn mô hình khó khăn. Ai có kinh nghiệm chia sẻ giúp! #AI #dịchthuật #Mistral #Gemma #AItranslation #language_models #gpt_oss
(NOTE: Post is in Vietnamese, under 500字符, includes both English & Vietnamese tags, no URLs. Original content is a
-
Show HN: LLM Onestop – Access ChatGPT, Claude, Gemini, and more in one interface
https://www.llmonestop.com
#ycombinator #all_llms_in_one_place #all_ai_models_in_one_place #LLM_OneStop #unified_llm_platform #chatgpt_claude_gemini #multiple_ai_models #llm_comparison #ai_platform #language_models -
🤡 Scientists have discovered that narrowly finetuning large language models can lead to hilariously misaligned results 🤯. Who knew that stretching a rubber band in one place would make the whole thing snap? 🙄 Bravo to the geniuses who spend years fine-tuning #chaos. 👏
https://arxiv.org/abs/2502.17424 #scientificdiscovery #humor #language_models #misalignment #fine_tuning #HackerNews #ngated -
"🧐 Researchers bravely attempt to 'liberate' snippets from books using language models, ignoring copyright like it's an optional suggestion. 📚🤖 Meanwhile, #arXiv is casually looking to hire a #DevOps engineer, because who doesn't want to work for a glorified PDF repository? 💻🎉"
https://arxiv.org/abs/2505.12546 #liberationofknowledge #copyrightissues #language_models #hiring #HackerNews #ngated -
Block Diffusion: Interpolating Autoregressive and Diffusion Language Models
https://m-arriola.com/bd3lms/
#ycombinator #block_diffusion #discrete #masked #diffusion #language_models #BD3_LM #BD3_LMs -
Block Diffusion: Interpolating Autoregressive and Diffusion Language Models
https://m-arriola.com/bd3lms/
#ycombinator #block_diffusion #discrete #masked #diffusion #language_models #BD3_LM #BD3_LMs -
Block Diffusion: Interpolating Autoregressive and Diffusion Language Models
https://m-arriola.com/bd3lms/
#ycombinator #block_diffusion #discrete #masked #diffusion #language_models #BD3_LM #BD3_LMs -
Block Diffusion: Interpolating Autoregressive and Diffusion Language Models
https://m-arriola.com/bd3lms/
#ycombinator #block_diffusion #discrete #masked #diffusion #language_models #BD3_LM #BD3_LMs -
Ah, the riveting world of "circuit tracing" in language models 🤖🔍, because what we really needed was another way to complicate things we barely understand. A "replacement model" that makes things "interpretable"? 😂 More like a desperate attempt to justify endless AI research grants.
https://transformer-circuits.pub/2025/attribution-graphs/methods.html #circuittracing #AIinterpretability #researchgrants #language_models #techhumor #HackerNews #ngated -
Ah, the riveting world of "circuit tracing" in language models 🤖🔍, because what we really needed was another way to complicate things we barely understand. A "replacement model" that makes things "interpretable"? 😂 More like a desperate attempt to justify endless AI research grants.
https://transformer-circuits.pub/2025/attribution-graphs/methods.html #circuittracing #AIinterpretability #researchgrants #language_models #techhumor #HackerNews #ngated -
Ah, the riveting world of "circuit tracing" in language models 🤖🔍, because what we really needed was another way to complicate things we barely understand. A "replacement model" that makes things "interpretable"? 😂 More like a desperate attempt to justify endless AI research grants.
https://transformer-circuits.pub/2025/attribution-graphs/methods.html #circuittracing #AIinterpretability #researchgrants #language_models #techhumor #HackerNews #ngated -
Ah, the riveting world of "circuit tracing" in language models 🤖🔍, because what we really needed was another way to complicate things we barely understand. A "replacement model" that makes things "interpretable"? 😂 More like a desperate attempt to justify endless AI research grants.
https://transformer-circuits.pub/2025/attribution-graphs/methods.html #circuittracing #AIinterpretability #researchgrants #language_models #techhumor #HackerNews #ngated -
DeepSeek's R1 AI Model Faces Criticism Over Security Vulnerabilities - https://www.redpacketsecurity.com/deepseek-s-flagship-ai-model-under-fire-for-security-vulnerabilities/
-
ChatGPT Learned to Reason [video]
https://www.youtube.com/watch?v=PvDaPeQjxOE
#ycombinator #AI_reasoning #ChatGPT_explained #artificial_intelligence #neural_networks #Monte_Carlo_Tree_Search #DeepMind #AlphaGo #chess_AI #language_models #machine_learning #reinforcement_learning #deep_learning #AI_history #GPT_training #chain_of_thought #AI_breakthrough #game_AI #TD_Gammon #MuZero #Claude_AI #O1_AI #AI_algorithms #AI_development #computer_reasoning #AI_evolution #future_AI -
ChatGPT Learned to Reason [video]
https://www.youtube.com/watch?v=PvDaPeQjxOE
#ycombinator #AI_reasoning #ChatGPT_explained #artificial_intelligence #neural_networks #Monte_Carlo_Tree_Search #DeepMind #AlphaGo #chess_AI #language_models #machine_learning #reinforcement_learning #deep_learning #AI_history #GPT_training #chain_of_thought #AI_breakthrough #game_AI #TD_Gammon #MuZero #Claude_AI #O1_AI #AI_algorithms #AI_development #computer_reasoning #AI_evolution #future_AI -
ChatGPT Learned to Reason [video]
https://www.youtube.com/watch?v=PvDaPeQjxOE
#ycombinator #AI_reasoning #ChatGPT_explained #artificial_intelligence #neural_networks #Monte_Carlo_Tree_Search #DeepMind #AlphaGo #chess_AI #language_models #machine_learning #reinforcement_learning #deep_learning #AI_history #GPT_training #chain_of_thought #AI_breakthrough #game_AI #TD_Gammon #MuZero #Claude_AI #O1_AI #AI_algorithms #AI_development #computer_reasoning #AI_evolution #future_AI -
ChatGPT Learned to Reason [video]
https://www.youtube.com/watch?v=PvDaPeQjxOE
#ycombinator #AI_reasoning #ChatGPT_explained #artificial_intelligence #neural_networks #Monte_Carlo_Tree_Search #DeepMind #AlphaGo #chess_AI #language_models #machine_learning #reinforcement_learning #deep_learning #AI_history #GPT_training #chain_of_thought #AI_breakthrough #game_AI #TD_Gammon #MuZero #Claude_AI #O1_AI #AI_algorithms #AI_development #computer_reasoning #AI_evolution #future_AI -
Academics Develop Testing Benchmark for LLMs in Cyber Threat Intelligence - https://www.redpacketsecurity.com/academics-develop-testing-benchmark-for-llms-in-cyber-threat-intelligence/
#threatintel #cyber_security #artificial_intelligence #language_models
-
Kolmogorov-Arnold Networks: MLP vs. Kan, Math, Universal Approximation Theorem [video]
https://www.youtube.com/watch?v=-PFIkkwWdnM
#ycombinator #pytorch #python #tutorial #math #language_models #deep_learning #machine_learning #multi_layer_perceptron #mlp #kolmogorov_arnold_networks #kolmogorov_arnold_representation_theorem #universal_approximation_theorem #neural_networks #bezier_curves #splines #b_splines #linear_layers -
Kolmogorov-Arnold Networks: MLP vs. Kan, Math, Universal Approximation Theorem [video]
https://www.youtube.com/watch?v=-PFIkkwWdnM
#ycombinator #pytorch #python #tutorial #math #language_models #deep_learning #machine_learning #multi_layer_perceptron #mlp #kolmogorov_arnold_networks #kolmogorov_arnold_representation_theorem #universal_approximation_theorem #neural_networks #bezier_curves #splines #b_splines #linear_layers -
Kolmogorov-Arnold Networks: MLP vs. Kan, Math, Universal Approximation Theorem [video]
https://www.youtube.com/watch?v=-PFIkkwWdnM
#ycombinator #pytorch #python #tutorial #math #language_models #deep_learning #machine_learning #multi_layer_perceptron #mlp #kolmogorov_arnold_networks #kolmogorov_arnold_representation_theorem #universal_approximation_theorem #neural_networks #bezier_curves #splines #b_splines #linear_layers -
Kolmogorov-Arnold Networks: MLP vs. Kan, Math, Universal Approximation Theorem [video]
https://www.youtube.com/watch?v=-PFIkkwWdnM
#ycombinator #pytorch #python #tutorial #math #language_models #deep_learning #machine_learning #multi_layer_perceptron #mlp #kolmogorov_arnold_networks #kolmogorov_arnold_representation_theorem #universal_approximation_theorem #neural_networks #bezier_curves #splines #b_splines #linear_layers