home.social

#openweightmodel — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #openweightmodel, aggregated by home.social.

  1. DeepSeek-V4: Low-Cost Logic via Hybrid Attention Architectures Presentational View Introduction There is an evident inclination toward novel innovations that incorporate unique structural modificat...

    #open-weight-model #agentic-search #deepseek-v4 #hybrid-attention #coding-agents

    Origin | Interest | Match
  2. #DeepSeek’s #R1 #AImodel, a cheaper rival to US tools, excels at reasoning tasks and is the most popular #openweightmodel on Hugging Face. The model, trained for US$294,000 using Nvidia’s H800 chips, employs pure reinforcement learning, rewarding correct answers rather than following human examples. This approach, detailed in a peer-reviewed Nature paper, has influenced AI research in 2025. nature.com/articles/d41586-025 #tech #media #news