home.social

#openweight — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #openweight, aggregated by home.social.

  1. Moving away from expensive frontier models ( #OpenAI, #Claude, #Gemini) to build a custom #openweight AI setup. My current workflow orchestrates #kimi k2.6, #deepSeek v4, and #glm using Oh My OpenAgent as base.

    Read about my setup here: richardorilla.website/seting_u

    #development #aidev

  2. Moving away from expensive frontier models ( #OpenAI, #Claude, #Gemini) to build a custom #openweight AI setup. My current workflow orchestrates #kimi k2.6, #deepSeek v4, and #glm using Oh My OpenAgent as base.

    Read about my setup here: richardorilla.website/seting_u

    #development #aidev

  3. Moving away from expensive frontier models ( #OpenAI, #Claude, #Gemini) to build a custom #openweight AI setup. My current workflow orchestrates #kimi k2.6, #deepSeek v4, and #glm using Oh My OpenAgent as base.

    Read about my setup here: richardorilla.website/seting_u

    #development #aidev

  4. Moving away from expensive frontier models ( #OpenAI, #Claude, #Gemini) to build a custom #openweight AI setup. My current workflow orchestrates #kimi k2.6, #deepSeek v4, and #glm using Oh My OpenAgent as base.

    Read about my setup here: richardorilla.website/seting_u

    #development #aidev

  5. Moving away from expensive frontier models ( #OpenAI, #Claude, #Gemini) to build a custom #openweight AI setup. My current workflow orchestrates #kimi k2.6, #deepSeek v4, and #glm using Oh My OpenAgent as base.

    Read about my setup here: richardorilla.website/seting_u

    #development #aidev

  6. Mistral Medium 3.5 Developer Guide: API, Remote Agents & Pricing 2026

    Mistral Medium 3.5 is a 128B open-weight dense model with 77.6% SWE-Bench Verified and a 256K context window, released April 29, 2026 under a modified MIT license. It ships with...

    wowhow.cloud/blogs/mistral-med

    #wowhow #mistral #aimodels #openweight

  7. Today I coded for 4 hours with #OpenCode using GLM 5 instead of Claude Sonnet 4.6. The project is a #Deno app written in #typescript and the agent had to refactor code and HTML layout.

    I perceived no difference in capabilities. GLM5 handled edits well, made some mistakes just like Sonnet, and produced output of the same quality.

    But GLM 5 is 3x smaller than Sonnet!

    Private AI models are wasteful. They're too big, use too much power, and are too expensive to run. #openweight is the future.

  8. #Mistral has built a $14B empire by focusing on #openweight #AI models, which prioritise data #sovereignty and #independence from American and Chinese tech giants. While Mistral’s models lag behind those of OpenAI and Anthropic in performance, its emphasis on #security and #localisation has resonated with European companies and governments. forbes.com/sites/iainmartin/20 #tech #media #news

  9. #Mistral has built a $14B empire by focusing on #openweight #AI models, which prioritise data #sovereignty and #independence from American and Chinese tech giants. While Mistral’s models lag behind those of OpenAI and Anthropic in performance, its emphasis on #security and #localisation has resonated with European companies and governments. forbes.com/sites/iainmartin/20 #tech #media #news

  10. #Mistral has built a $14B empire by focusing on #openweight #AI models, which prioritise data #sovereignty and #independence from American and Chinese tech giants. While Mistral’s models lag behind those of OpenAI and Anthropic in performance, its emphasis on #security and #localisation has resonated with European companies and governments. forbes.com/sites/iainmartin/20 #tech #media #news

  11. #Mistral has built a $14B empire by focusing on #openweight #AI models, which prioritise data #sovereignty and #independence from American and Chinese tech giants. While Mistral’s models lag behind those of OpenAI and Anthropic in performance, its emphasis on #security and #localisation has resonated with European companies and governments. forbes.com/sites/iainmartin/20 #tech #media #news

  12. #Mistral has built a $14B empire by focusing on #openweight #AI models, which prioritise data #sovereignty and #independence from American and Chinese tech giants. While Mistral’s models lag behind those of OpenAI and Anthropic in performance, its emphasis on #security and #localisation has resonated with European companies and governments. forbes.com/sites/iainmartin/20 #tech #media #news

  13. #Mistral has built a $14B empire by focusing on #openweight #AI models, which prioritise data #sovereignty and #independence from American and Chinese tech giants. While Mistral’s models lag behind those of OpenAI and Anthropic in performance, its emphasis on #security and #localisation has resonated with European companies and governments. forbes.com/sites/iainmartin/20 #AIagent #AI #ML #NLP #LLM #GenAI

  14. #Mistral has built a $14B empire by focusing on #openweight #AI models, which prioritise data #sovereignty and #independence from American and Chinese tech giants. While Mistral’s models lag behind those of OpenAI and Anthropic in performance, its emphasis on #security and #localisation has resonated with European companies and governments. forbes.com/sites/iainmartin/20 #AIagent #AI #ML #NLP #LLM #GenAI

  15. #Mistral has built a $14B empire by focusing on #openweight #AI models, which prioritise data #sovereignty and #independence from American and Chinese tech giants. While Mistral’s models lag behind those of OpenAI and Anthropic in performance, its emphasis on #security and #localisation has resonated with European companies and governments. forbes.com/sites/iainmartin/20 #AIagent #AI #ML #NLP #LLM #GenAI

  16. #Mistral has built a $14B empire by focusing on #openweight #AI models, which prioritise data #sovereignty and #independence from American and Chinese tech giants. While Mistral’s models lag behind those of OpenAI and Anthropic in performance, its emphasis on #security and #localisation has resonated with European companies and governments. forbes.com/sites/iainmartin/20 #AIagent #AI #ML #NLP #LLM #GenAI

  17. #Mistral has built a $14B empire by focusing on #openweight #AI models, which prioritise data #sovereignty and #independence from American and Chinese tech giants. While Mistral’s models lag behind those of OpenAI and Anthropic in performance, its emphasis on #security and #localisation has resonated with European companies and governments. forbes.com/sites/iainmartin/20 #AIagent #AI #ML #NLP #LLM #GenAI

  18. Kleines Update dazu: Das #llm #MiniMax #M2.7 soll bald als #OpenWeight veröffentlicht werden. Dann waren meine Sorgen diesbezüglich zum Glück umsonst.

  19. #Google releases #VaultGemma, its first #privacy-preserving #LLM
    #GoogleResearch shows that #AI models can keep training data private.
    This work on differential privacy has led to a new #openweight Google model called VaultGemma. The model uses differential privacy to reduce the possibility of memorization, which could change how Google builds privacy into its future AI agents. For now, though, the company's first differential privacy model is an experiment.
    arstechnica.com/ai/2025/09/goo

  20. The #US #AIActionPlan prioritises #opensource and #openweight #AI to compete with #China’s growing influence in the field. China’s open-source models, like #DeepSeek #R1, have gained widespread adoption, including in the US. The US, once a leader in open-source AI, is now at risk of falling behind if it doesn’t prioritise #openness and #collaboration. venturebeat.com/ai/why-open-so #tech #media #news

  21. @katzenberger @silentexception

    So, from that perspective, it is important that:
    1. We find the appropriate job/task for which the #LLM is indispensible, and not use it for everything
    2. We try to reuse the LLM as much as possible rather than training again and again

    #opensource #openweight #ai #jtbd #economics #foss #genai

  22. Dieser Atlantic-Artikel zeigt sehr eindrücklich die fragile Basis des #AI Booms.
    Aus #cybersecurity Sicht stelle ich mir zwei Fragen:
    1. Wollen wir trotz der #digitalsovereignty Debatte die Zuspitzung auf die Hyperscaler weiter forcieren - oder gestalten wir aktiv Vendor-Diversifikation?
    2. Wie resilient ist mein AI-Use-Case wenn LLM-Kosten signifikant steigen? #OpenWeight und #Selfhosting sind keine Nischenlösungen, sondern sinnvolle Optionen im #TPRM.

    Hier der Artikel von Matteo Wong und Charlie Warzel, #TheAtlantic
    theatlantic.com/technology/202

  23. Les entreprise de la tech se foutent de vous la

    L' #OpenWeight c'est juste du #feeeware déguisé façon #LLM 🤷‍♀️

    C pas #OpenSource c'est réutilisable sans payer de licence c'est tout

  24. #Ai2 has released #MolmoWeb, an #openweight #visualwebagent that operates from browser screenshots and executes actions like clicking, typing, and scrolling. It comes with #MolmoWebMix, a dataset of 30,000 human task trajectories and 2.2 million screenshot question-answer pairs, making it the largest publicly released collection of human web-task execution. venturebeat.com/data/ai2-relea #tech #media #news

  25. Das chinesische #llm #MiniMax M2.7 übertrifft seinen Vorgänger M2.5 bei weitem und ist im Bereich von Athropics #Opus angekommen.
    Leider scheint sich nun auch die Veröffentlichungsstrategie von #OpenWeight zu propritär zu wandeln.
    Ich bin gespannt wie sich die chinesische Konkurrenz #Kimi, #GLM und #Qwen in Zukunft verhält.
    Ich halte OpenWeight für unverzichtbar - der Markt vielleicht schon.

  26. An open-weight, 380M param, noninvasive thought to text model, via reconstructed, denoised, and upsampled EEG data.

    zyphra.com/post/zuna

    #AI #AIResearch #OpenWeight

  27. Io faccio una mia stima (che in realtà qualcuno di voi l’avrà già dedotta da tempo leggendo i miei toot).

    Ci sarà un massiccio abbandono della (vera) comunità vecchia dei modelli di #AI americani a favore di modelli #openweight.

    Vi ricordo che c’è poco hype per DeepSeek4… troppo presto? Forse… vedremo…

    #llm #deepseek

  28. Qwen3.5 (#OpenWeight) was released yesterday!

    github.com/QwenLM/Qwen3.5

    Authors claim better than GPT-5.2 "average ranking" (and close to Opus-4.5-Thinking):

    qwen.ai/blog?id=qwen3.5

    The first release is too large to run on most GPUs (397B total parameters and 17B active parameters, good for 8xH200), but "More sizes are coming".

    These models are improving at an impressive rate. That's were AI is improving the fastest (more than from hardware improvements).

    #AI #Qwen

  29. #SiliconValley #AI startups are seeing record valuations, but many are building on cheap, free-to-download AI models from #China
    “These models were not that far behind the frontier. In fact, they were surprisingly close to the frontier. The ones that are coming now, well they’re palpably close to the frontier.”
    Models like #DeepSeek’s #R1 and #Alibaba’s #Qwen, are free to use & considered “#opensource” or “#openweight” because anyone can download, copy, modify & use them.
    nbcnews.com/tech/innovation/si

  30. #Ai2 has released #MolmoWeb, an #openweight #visualwebagent that operates from browser screenshots and executes actions like clicking, typing, and scrolling. It comes with #MolmoWebMix, a dataset of 30,000 human task trajectories and 2.2 million screenshot question-answer pairs, making it the largest publicly released collection of human web-task execution. venturebeat.com/data/ai2-relea #tech #media #news

  31. #Ai2 has released #MolmoWeb, an #openweight #visualwebagent that operates from browser screenshots and executes actions like clicking, typing, and scrolling. It comes with #MolmoWebMix, a dataset of 30,000 human task trajectories and 2.2 million screenshot question-answer pairs, making it the largest publicly released collection of human web-task execution. venturebeat.com/data/ai2-relea #tech #media #news

  32. #Ai2 has released #MolmoWeb, an #openweight #visualwebagent that operates from browser screenshots and executes actions like clicking, typing, and scrolling. It comes with #MolmoWebMix, a dataset of 30,000 human task trajectories and 2.2 million screenshot question-answer pairs, making it the largest publicly released collection of human web-task execution. venturebeat.com/data/ai2-relea #tech #media #news

  33. #Ai2 has released #MolmoWeb, an #openweight #visualwebagent that operates from browser screenshots and executes actions like clicking, typing, and scrolling. It comes with #MolmoWebMix, a dataset of 30,000 human task trajectories and 2.2 million screenshot question-answer pairs, making it the largest publicly released collection of human web-task execution. venturebeat.com/data/ai2-relea #tech #media #news

  34. Fal just dropped Flux 2 Turbo – an open‑weight model that’s 10× cheaper and 6× more efficient for AI image synthesis. Powered by Black Forest Labs, it runs on Nano Banana Pro and pairs with GPT‑Image 1.5. Curious how this changes the open‑source scene? Read the full breakdown! #Fal #Flux2Turbo #OpenWeight #AIImageSynthesis

    🔗 aidailypost.com/news/fal-relea

  35. "There are a number of diffusion models out there, but I have tended to use Midjourney, which has been around longer than many other AI tools. Using Midjourney allows us to see how diffusion models have developed over time, as you can see with the simple prompt “otter on a plane using wifi” (for every image and video in this post, I pick the best out of the first four images generated). We go from melted fur at the start of 2022 to a visible otter (with too many fingers and a weird keyboard) at the end of that year. In 2023, we get a photorealistic otter, but still a weird keyboard and plane windows. In 2024, the lighting and positioning become better, and by 2025 we have excellent photorealism.

    But what makes diffusion models interesting is not their increasing ability to make photorealistic images, but rather the fact that they can create images in various styles. This cuts to the heart of why AI image generation is so controversial, as many AI models are trained on images from throughout the web, including copyrighted work, and can thus replicate images in the style of living artists without their permission or compensation."

    oneusefulthing.org/p/the-recen

    #AU #GenerativeAI #GeneratedImages #DiffusionModels #OpenWeight