home.social

#stablediffusion — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #stablediffusion, aggregated by home.social.

  1. New week, more slides: Run LLMs Locally

    Now including wllama to run GGUF models inside your browser!

    wllama uses llama.cpp, WebAssembly and WebGPU, bringing a completely new experience of LLMs into the web.
    It has no 4 GB limitation and is faster than Transformers.js.

    I also added translations using the HY-MT model from Tencent.

    codeberg.org/thbley/talks/raw/

    #ai #llm #llamacpp #wllama #stablediffusion #qwen3 #glm #localai #gemma4 #webgpu #opencode #mtp #webassembly

  2. New week, more slides: Run LLMs Locally

    Now including wllama to run GGUF models inside your browser!

    wllama uses llama.cpp, WebAssembly and WebGPU, bringing a completely new experience of LLMs into the web.
    It has no 4 GB limitation and is faster than Transformers.js.

    I also added translations using the HY-MT model from Tencent.

    codeberg.org/thbley/talks/raw/

    #ai #llm #llamacpp #wllama #stablediffusion #qwen3 #glm #localai #gemma4 #webgpu #opencode #mtp #webassembly

  3. New week, more slides: Run LLMs Locally

    Now including wllama to run GGUF models inside your browser!

    wllama uses llama.cpp, WebAssembly and WebGPU, bringing a completely new experience of LLMs into the web.
    It has no 4 GB limitation and is faster than Transformers.js.

    I also added translations using the HY-MT model from Tencent.

    codeberg.org/thbley/talks/raw/

    #ai #llm #llamacpp #wllama #stablediffusion #qwen3 #glm #localai #gemma4 #webgpu #opencode #mtp #webassembly

  4. New week, more slides: Run LLMs Locally

    Now including wllama to run GGUF models inside your browser!

    wllama uses llama.cpp, WebAssembly and WebGPU, bringing a completely new experience of LLMs into the web.
    It has no 4 GB limitation and is faster than Transformers.js.

    I also added translations using the HY-MT model from Tencent.

    codeberg.org/thbley/talks/raw/

    #ai #llm #llamacpp #wllama #stablediffusion #qwen3 #glm #localai #gemma4 #webgpu #opencode #mtp #webassembly

  5. New week, more slides: Run LLMs Locally

    Now including wllama to run GGUF models inside your browser!

    wllama uses llama.cpp, WebAssembly and WebGPU, bringing a completely new experience of LLMs into the web.
    It has no 4 GB limitation and is faster than Transformers.js.

    I also added translations using the HY-MT model from Tencent.

    codeberg.org/thbley/talks/raw/

    #ai #llm #llamacpp #wllama #stablediffusion #qwen3 #glm #localai #gemma4 #webgpu #opencode #mtp #webassembly