home.social

#nemotron — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #nemotron, aggregated by home.social.

  1. NVIDIA Nemotron 3 Nano Omni: Open Multimodal AI Agent Guide 2026

    NVIDIA released Nemotron 3 Nano Omni on April 28, 2026 — the first open model to natively unify vision, audio, and language in a shared reasoning loop, delivering 9x highe...

    wowhow.cloud/blogs/nvidia-nemo

    #wowhow #nvidia #nemotron #multimodalai

  2. New week, more slides: Run LLMs Locally

    Now with LFM 2 and new slides for using Transformers.js with WebGPU for Privacy Filter, Function Calling and Embeddings, running completely in your browser.

    codeberg.org/thbley/talks/raw/

    #ai #llm #llamacpp #stablediffusion #gptoss #qwen3 #glm #localai #gemma4 #nemotron #webgpu

  3. New week, more slides: Run LLMs Locally

    Now with LFM 2 and new slides for using Transformers.js with WebGPU for Privacy Filter, Function Calling and Embeddings, running completely in your browser.

    codeberg.org/thbley/talks/raw/

    #ai #llm #llamacpp #stablediffusion #gptoss #qwen3 #glm #localai #gemma4 #nemotron #webgpu

  4. New week, more slides: Run LLMs Locally

    Now with LFM 2 and new slides for using Transformers.js with WebGPU for Privacy Filter, Function Calling and Embeddings, running completely in your browser.

    codeberg.org/thbley/talks/raw/

    #ai #llm #llamacpp #stablediffusion #gptoss #qwen3 #glm #localai #gemma4 #nemotron #webgpu

  5. New week, more slides: Run LLMs Locally

    Now with LFM 2 and new slides for using Transformers.js with WebGPU for Privacy Filter, Function Calling and Embeddings, running completely in your browser.

    codeberg.org/thbley/talks/raw/

    #ai #llm #llamacpp #stablediffusion #gptoss #qwen3 #glm #localai #gemma4 #nemotron #webgpu

  6. New week, more slides: Run LLMs Locally

    Now with LFM 2 and new slides for using Transformers.js with WebGPU for Privacy Filter, Function Calling and Embeddings, running completely in your browser.

    codeberg.org/thbley/talks/raw/

    #ai #llm #llamacpp #stablediffusion #gptoss #qwen3 #glm #localai #gemma4 #nemotron #webgpu

  7. New week, new slides: Run LLMs Locally

    Now including Nemotron 3 Nano Omni from Nvidia, Llama.cpp built-in tools and new slides about using Transformers.js with WebGPU for Image Recognition and OCR.

    codeberg.org/thbley/talks/raw/

    #ai #llm #llamacpp #stablediffusion #gptoss #qwen3 #glm #localai #gemma4 #nemotron #webgpu

  8. New week, new slides: Run LLMs Locally

    Now including Nemotron 3 Nano Omni from Nvidia, Llama.cpp built-in tools and new slides about using Transformers.js with WebGPU for Image Recognition and OCR.

    codeberg.org/thbley/talks/raw/

    #ai #llm #llamacpp #stablediffusion #gptoss #qwen3 #glm #localai #gemma4 #nemotron #webgpu

  9. New week, new slides: Run LLMs Locally

    Now including Nemotron 3 Nano Omni from Nvidia, Llama.cpp built-in tools and new slides about using Transformers.js with WebGPU for Image Recognition and OCR.

    codeberg.org/thbley/talks/raw/

    #ai #llm #llamacpp #stablediffusion #gptoss #qwen3 #glm #localai #gemma4 #nemotron #webgpu

  10. New week, new slides: Run LLMs Locally

    Now including Nemotron 3 Nano Omni from Nvidia, Llama.cpp built-in tools and new slides about using Transformers.js with WebGPU for Image Recognition and OCR.

    codeberg.org/thbley/talks/raw/

    #ai #llm #llamacpp #stablediffusion #gptoss #qwen3 #glm #localai #gemma4 #nemotron #webgpu

  11. New week, new slides: Run LLMs Locally

    Now including Nemotron 3 Nano Omni from Nvidia, Llama.cpp built-in tools and new slides about using Transformers.js with WebGPU for Image Recognition and OCR.

    codeberg.org/thbley/talks/raw/

    #ai #llm #llamacpp #stablediffusion #gptoss #qwen3 #glm #localai #gemma4 #nemotron #webgpu

  12. The world of AI has improved so much compared to 8 months ago. A 30B Nvidia #Nemotron small #LLM can do so much better than ChatGPT 2 that was introduced 2 years ago. If you are not too ambitious, you can easily run any 30B models like Qwen 3.6 or Nemortron 3 on a local I9 rtx 5080 machine doing fast inference (plus #gaming). Kids can learn almost anything from these smaller models.

    Here is what I got by asking Nemotron 3 what an ansistring in #Pascal is like.

    #AI

  13. 한국인 700만 명의 합성 데이터, AI 에이전트 맥락 문제를 바꾼다

    NVIDIA가 공개한 한국인 700만 합성 페르소나 데이터셋 Nemotron-Personas-Korea. 공식 통계 기반으로 AI 에이전트의 한국 문화·언어 맥락 문제를 해결합니다.

    aisparkup.com/posts/11639

  14. La precedente esperienza con Qwen3.5 non aveva dato i risultati sperati. Nonostante ore di lavoro e feedback continui, il modello non è mai riuscito a produrre un’applicazione funzionante: regressioni cicliche ed errori difficilmente superabili con le capacità dello strumento hanno bloccato ogni progresso.

    Ho voluto quindi riprovare con Nemotron-Cascade-2, ma le sue richieste hardware si […]

    #agenticAi #ai #claudeCode #nemotron #openrouter #qwen35 https://www.b0sh.net/2026/03/nemotron-3-super-vs-qwen3-5-costruire-unapp-con-lai-senza-scrivere-codice/
  15. Nemotron Speech ASR might be the new go‑to for real‑time speech recognition. 🎙️⚡

    It is an open, English streaming ASR model from NVIDIA (~0.6B params) using a cache‑aware FastConformer + RNNT design, built for ultra‑low latency voice agents and live captioning.

    Wrote a breakdown of the architecture, latency/accuracy trade offs, & why it matters for devs building agentic AI systems:
    techglimmer.io/what-is-nemotro

    #Nemotron #SpeechASR #NVIDIA #ASR #VoiceAI #FediAI #TechGlimmer