home.social

Adam :redhat: :ansible: :bash:

View on fosstodon.org
  1. Red Hat and Tesla engineers tackled a real production problem together.

    3x output tokens/sec, 2x faster TTFT on Llama 3.1 70B with KServe + llm-d + vLLM. Fixes pushed upstream to KServe along the way.

    This is what open source looks like. 🤝 🚀

    llm-d.ai/blog/production-grade

  2. Achieve better large language model inference with fewer GPUs

    "we achieved approximately 55-65% of the throughput on a server config that is approximately 15% of the cost"

    redhat.com/en/blog/achieve-bet

  3. I'm just going to say it, and we can agree to disagree if you do in fact disagree...

    systemd has categorically made Linux better in basically every way imaginable

    It's earnestly cool if you don't agree but it's really really good

    🤷