Sign in Create account

#llmd — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #llmd, aggregated by home.social.

Adam :redhat: :ansible: :bash: @[email protected] · 2026-04-23 · 19:23 UTC

Red Hat and Tesla engineers tackled a real production problem together.
3x output tokens/sec, 2x faster TTFT on Llama 3.1 70B with KServe + llm-d + vLLM. Fixes pushed upstream to KServe along the way.
This is what open source looks like. 🤝 🚀
https://llm-d.ai/blog/production-grade-llm-inference-at-scale-kserve-llm-d-vllm
#RedHat #Tesla #RedHatAI #vLLM #Pytorch #Kubernetes #OpenShift #KServe #llmd #Llama #OpenSource

#redhat #tesla #redhatai #vllm #pytorch #kubernetes
Adam :redhat: :ansible: :bash: @maxamillion · 2026-04-23 · 19:23 UTC

Red Hat and Tesla engineers tackled a real production problem together.
3x output tokens/sec, 2x faster TTFT on Llama 3.1 70B with KServe + llm-d + vLLM. Fixes pushed upstream to KServe along the way.
This is what open source looks like. 🤝 🚀
https://llm-d.ai/blog/production-grade-llm-inference-at-scale-kserve-llm-d-vllm
#RedHat #Tesla #RedHatAI #vLLM #Pytorch #Kubernetes #OpenShift #KServe #llmd #Llama #OpenSource

#redhat #tesla #redhatai #vllm #pytorch #kubernetes
Adam :redhat: :ansible: :bash: @[email protected] · 2026-04-23 · 19:23 UTC

Red Hat and Tesla engineers tackled a real production problem together.
3x output tokens/sec, 2x faster TTFT on Llama 3.1 70B with KServe + llm-d + vLLM. Fixes pushed upstream to KServe along the way.
This is what open source looks like. 🤝 🚀
https://llm-d.ai/blog/production-grade-llm-inference-at-scale-kserve-llm-d-vllm
#RedHat #Tesla #RedHatAI #vLLM #Pytorch #Kubernetes #OpenShift #KServe #llmd #Llama #OpenSource

#redhat #tesla #redhatai #vllm #pytorch #kubernetes
Adam :redhat: :ansible: :bash: @[email protected] · 2026-04-23 · 19:23 UTC

Red Hat and Tesla engineers tackled a real production problem together.
3x output tokens/sec, 2x faster TTFT on Llama 3.1 70B with KServe + llm-d + vLLM. Fixes pushed upstream to KServe along the way.
This is what open source looks like. 🤝 🚀
https://llm-d.ai/blog/production-grade-llm-inference-at-scale-kserve-llm-d-vllm
#RedHat #Tesla #RedHatAI #vLLM #Pytorch #Kubernetes #OpenShift #KServe #llmd #Llama #OpenSource

#opensource #llama #llmd #kserve #openshift #kubernetes
Adam :redhat: :ansible: :bash: @[email protected] · 2026-04-23 · 19:23 UTC

Red Hat and Tesla engineers tackled a real production problem together.
3x output tokens/sec, 2x faster TTFT on Llama 3.1 70B with KServe + llm-d + vLLM. Fixes pushed upstream to KServe along the way.
This is what open source looks like. 🤝 🚀
https://llm-d.ai/blog/production-grade-llm-inference-at-scale-kserve-llm-d-vllm
#RedHat #Tesla #RedHatAI #vLLM #Pytorch #Kubernetes #OpenShift #KServe #llmd #Llama #OpenSource

#redhat #tesla #redhatai #vllm #pytorch #kubernetes
Adam :redhat: :ansible: :bash: @maxamillion · 2025-11-13 · 22:19 UTC

Red Hat AI 3 is GA!
https://docs.redhat.com/en/documentation/red_hat_ai/3
#RedHat #RedHatAI #RHAIIS #OpenShift #OpenShiftAI #vLLM #KServe #Kubeflow #llmd #AI #GenAI #AIPlatform #OpenSource #OpenSourceAI

#redhat #redhatai #rhaiis #openshift #openshiftai #vllm
Adam :redhat: :ansible: :bash: @maxamillion · 2025-10-14 · 18:23 UTC

3 things to know about Red Hat AI 3
https://www.youtube.com/watch?v=eztORiJWYMs
#RedHat #AI #RedHatAI #llmd #Agentic #MCP #ModelContextProtocol #LlamaStack #OpenSource #OpenShift #OpenShiftAI

#redhat #ai #redhatai #llmd #agentic #mcp
Carol Chen @[email protected] · 2025-09-24 · 18:33 UTC

Updating stickers on laptops... let's see how many I can tag
@matrix @ansible @InstructLab @thinkpadmuseum @trustyai @github @fedora
#ramalama #docling #vllm #llmd #pytorch #NERC #redhat #ospo #upstream #ansible #thinkpad #womeninfedora #expo2025 #kubeflow #trustyAI #cushingcenter #operationstickybusiness

#ramalama #docling #vllm #llmd #pytorch #nerc
Yuan Tang :redhat: @terrytangyuan · 2025-08-11 · 15:45 UTC

Big thanks to everyone contributing code, reviews, and ideas — this integration is shaping up to be a game-changer for 𝗞𝘂𝗯𝗲𝗿𝗻𝗲𝘁𝗲𝘀-𝗻𝗮𝘁𝗶𝘃𝗲 𝗟𝗟𝗠 𝘀𝗲𝗿𝘃𝗶𝗻𝗴. Stay tuned for next release!
#KServe #llmd #GenerativeAI #MLOps #Kubernetes #ModelServing #AIInfrastructure

#kserve #llmd #generativeai #mlops #kubernetes #modelserving
Adam :redhat: :ansible: :bash: @maxamillion · 2025-05-29 · 17:58 UTC

Red Hat Launches the llm-d Community, Powering Distributed Gen AI Inference at Scale
https://www.redhat.com/en/about/press-releases/red-hat-launches-llm-d-community-powering-distributed-gen-ai-inference-scale
#RedHat #CoreWeave #Google #IBM #NVIDIA #AMD #Cisco #HuggingFace #Intel #Lambda #Mistral #OpenSource #AI #llmd #vllm #Linux #Kubernetes

#redhat #coreweave #google #ibm #nvidia #amd
N-gated Hacker News @[email protected] · 2025-05-20 · 14:14 UTC

🎉 Behold! The #llmd #community emerges from the depths of the #tech abyss, promising the holy grail of Kubernetes-native distributed #LLM #inference. 🤖 Because who doesn't want their #AI #deployments served with extra buzzwords and a side of "competitive performance per dollar"? 🍽️
https://llm-d.ai/blog/llm-d-announce #Kubernetes #news #HackerNews #ngated

#llmd #community #tech #llm #inference #ai