home.social

#aiinfra — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #aiinfra, aggregated by home.social.

  1. NVIDIA's GPU VRAM limits hit? AllSafeUs explores augmenting with system RAM/NVMe. At EnergenAI, TIAMAT runs 120B models with 20M tokens/day on edge clusters. We optimize memory routing across 52 tools—no synthetic VRAM needed. Real efficiency > hacks. #AIInfra #LLMOps #EdgeAI tiamat.live

  2. NVIDIA's GPU VRAM limits hit? AllSafeUs explores augmenting with system RAM/NVMe. At EnergenAI, TIAMAT runs 120B models with 20M tokens/day on edge clusters. We optimize memory routing across 52 tools—no synthetic VRAM needed. Real efficiency > hacks. #AIInfra #LLMOps #EdgeAI tiamat.live

  3. Thanks to everyone who joined our Microsoft Reactor session on secure, observable, production-ready agents. Repo + slides for the demo are here:
    aka.ms/microsoftnvidiademo

    #Azure #NVIDIA #Agents #AIInfra #OpenSource

  4. Thanks to everyone who joined our Microsoft Reactor session on secure, observable, production-ready agents. Repo + slides for the demo are here:
    aka.ms/microsoftnvidiademo

    #Azure #NVIDIA #Agents #AIInfra #OpenSource

  5. Thanks to everyone who joined our Microsoft Reactor session on secure, observable, production-ready agents. Repo + slides for the demo are here:
    aka.ms/microsoftnvidiademo

    #Azure #NVIDIA #Agents #AIInfra #OpenSource

  6. Thanks to everyone who joined our Microsoft Reactor session on secure, observable, production-ready agents. Repo + slides for the demo are here:
    aka.ms/microsoftnvidiademo

    #Azure #NVIDIA #Agents #AIInfra #OpenSource

  7. Thanks to everyone who joined our Microsoft Reactor session on secure, observable, production-ready agents. Repo + slides for the demo are here:
    aka.ms/microsoftnvidiademo

    #Azure #NVIDIA #Agents #AIInfra #OpenSource

  8. Production-ready agents need more than a prompt loop. Join me for a Microsoft Reactor livestream with NVIDIA on a multi-agent architecture where Foundry Agent Service acts as the control plane and GPU-backed agents run on Azure Container Apps.
    We’ll walk through document processing, security controls, tracing, and explainable results.
    🗓 Mar 11, 2026 • 9 AM PT / 6 PM PT
    👉 developer.microsoft.com/en-us/

    #Azure #NVIDIA #Agents #AIInfra #OpenSource

  9. Production-ready agents need more than a prompt loop. Join me for a Microsoft Reactor livestream with NVIDIA on a multi-agent architecture where Foundry Agent Service acts as the control plane and GPU-backed agents run on Azure Container Apps.
    We’ll walk through document processing, security controls, tracing, and explainable results.
    🗓 Mar 11, 2026 • 9 AM PT / 6 PM PT
    👉 developer.microsoft.com/en-us/

    #Azure #NVIDIA #Agents #AIInfra #OpenSource

  10. Production-ready agents need more than a prompt loop. Join me for a Microsoft Reactor livestream with NVIDIA on a multi-agent architecture where Foundry Agent Service acts as the control plane and GPU-backed agents run on Azure Container Apps.
    We’ll walk through document processing, security controls, tracing, and explainable results.
    🗓 Mar 11, 2026 • 9 AM PT / 6 PM PT
    👉 developer.microsoft.com/en-us/

    #Azure #NVIDIA #Agents #AIInfra #OpenSource

  11. Production-ready agents need more than a prompt loop. Join me for a Microsoft Reactor livestream with NVIDIA on a multi-agent architecture where Foundry Agent Service acts as the control plane and GPU-backed agents run on Azure Container Apps.
    We’ll walk through document processing, security controls, tracing, and explainable results.
    🗓 Mar 11, 2026 • 9 AM PT / 6 PM PT
    👉 developer.microsoft.com/en-us/

    #Azure #NVIDIA #Agents #AIInfra #OpenSource

  12. Production-ready agents need more than a prompt loop. Join me for a Microsoft Reactor livestream with NVIDIA on a multi-agent architecture where Foundry Agent Service acts as the control plane and GPU-backed agents run on Azure Container Apps.
    We’ll walk through document processing, security controls, tracing, and explainable results.
    🗓 Mar 11, 2026 • 9 AM PT / 6 PM PT
    👉 developer.microsoft.com/en-us/

    #Azure #NVIDIA #Agents #AIInfra #OpenSource

  13. My first time at #SCAsia/#HPCAsia this week. I don’t have a crazy preset agenda as I would at SC/ISC, so looking forward to having time to explore new experiences, hopes, and perspectives on #HPC and #AIinfra, meet people, etc.

  14. My first time at #SCAsia/#HPCAsia this week. I don’t have a crazy preset agenda as I would at SC/ISC, so looking forward to having time to explore new experiences, hopes, and perspectives on #HPC and #AIinfra, meet people, etc.

  15. My first time at #SCAsia/#HPCAsia this week. I don’t have a crazy preset agenda as I would at SC/ISC, so looking forward to having time to explore new experiences, hopes, and perspectives on #HPC and #AIinfra, meet people, etc.

  16. My first time at #SCAsia/#HPCAsia this week. I don’t have a crazy preset agenda as I would at SC/ISC, so looking forward to having time to explore new experiences, hopes, and perspectives on #HPC and #AIinfra, meet people, etc.

  17. My first time at #SCAsia/#HPCAsia this week. I don’t have a crazy preset agenda as I would at SC/ISC, so looking forward to having time to explore new experiences, hopes, and perspectives on #HPC and #AIinfra, meet people, etc.

  18. Neocloud Economics: CoreWeave vs Nebius – Vertical AI Infra Crushes Hyperscalers (60-70% Margins) ⚡

    Neoclouds own stack (chips→racks), dodge AWS debt/leasing. Nebius edges CoreWeave on costs; $10B+ ARR potential. AI training explodes demand

    Why vertical? 2-3x cheaper GPUs vs cloud giants.

    VCs: Next hyperscalers? Founders: Build atop. GPU wars incoming. 📈

    #Neocloud #CoreWeave #Nebius #AIInfra #GPUEconomics

    buff.ly/XJsjWqp

  19. La IA tiene hambre de energía y la solución es nuclear. ⚛️🤖 Analizamos el proyecto de Last Energy para crear mini reactores. ¿Sostenibilidad real o un riesgo necesario para el progreso tech? glitchmental.com/2025/12/react #Nuclear #AIInfra #MastodonTech #CleanEnergy

  20. ARBITER: what it is / what it isn’t

    IS

    semantic scoring
    geometric fit
    negative answers
    offline 26MB

    ISN’T

    LLM
    vector DB
    embeddings
    retrieval

    getarbiter.dev
    #AI #NLP #RAG #AIInfra #SemanticSearch

  21. ScaleOps' new AI Infra slashes GPU costs by half for self‑hosted LLMs while giving full visibility into pods, model behavior, and even a helm flag for easy tuning. Curious how you can cut spend and keep control? Read the full breakdown. #ScaleOps #AIInfra #GPUcosts #SelfHostedLLMs

    🔗 aidailypost.com/news/scaleops-

  22. ScaleOps' new AI Infra slashes GPU costs by half for self‑hosted LLMs while giving full visibility into pods, model behavior, and even a helm flag for easy tuning. Curious how you can cut spend and keep control? Read the full breakdown. #ScaleOps #AIInfra #GPUcosts #SelfHostedLLMs

    🔗 aidailypost.com/news/scaleops-

  23. 🔀 LiteLLM @LiteLLM

    Open-source LLM gateway + Python SDK for 100+ providers in OpenAI format. Integrates with Langfuse/LangSmith/OTEL for unified views.

    everydev.ai/tools/litellm

    #OpenSource #UnifiedAPI #LLMOrchestration #AIInfra #DevTools

  24. 🛰️ Martian @withmartian

    OpenAI LLM router with a willingness_to_pay knob—adjust the base_url to select the best model (cost × quality × latency). Includes open-source adapters for easy multi-model use.

    everydev.ai/tools/martian

    #LLM #ModelRouter #AIInfra #DevTools #OpenAIAPI

  25. Hey! We're new here (and we just launched!).

    Here at alpic, we believe every online service will soon need an interface for agents, not just humans. MCP is the protocol; Alpic is the infra.

    Why now ? Why us? We wrote a little something for you here: alpic.ai/blog/welcome-to-alpic

    #mcp #agenticAI #AIinfra

  26. 💅 Deploying AI infra like a queen?

    Meet Docker MCP - agents up in seconds, safety built-in, scaling? Just add vibes.

    This is DevOps, but make it 🔥smart + pretty.

    🎥 Watch the magic → youtube.com/shorts/ettAJBLH59Y

    #DevOpsGirl #Docker #AIInfra #CloudNative #InfraAsCode

  27. 💅 Deploying AI infra like a queen?

    Meet Docker MCP - agents up in seconds, safety built-in, scaling? Just add vibes.

    This is DevOps, but make it 🔥smart + pretty.

    🎥 Watch the magic → youtube.com/shorts/ettAJBLH59Y

    #DevOpsGirl #Docker #AIInfra #CloudNative #InfraAsCode

  28. 💅✨ What powers AI behind the scenes?

    Time to spill the DevOps tea 🫖 on Docker MCP

    💻⚙️ Smart infra, sleek containers, & cloud-native magic in under 60 secs 💥

    🎀 Watch now & level up → youtube.com/shorts/-zl2V2ZTuNk

    #DevOpsGirl #DockerMCP #CloudNative #AIInfra #TechTok

  29. 💅✨ What powers AI behind the scenes?

    Time to spill the DevOps tea 🫖 on Docker MCP

    💻⚙️ Smart infra, sleek containers, & cloud-native magic in under 60 secs 💥

    🎀 Watch now & level up → youtube.com/shorts/-zl2V2ZTuNk

    #DevOpsGirl #DockerMCP #CloudNative #AIInfra #TechTok

  30. 💻💅 Running AI agents? You better isolate them, babe.

    Docker MCP got your back with ✨secure containers✨ - no bugs, no drama.

    Smart infra is sexy, and so are you. 😘

    Watch me break it down 👉 youtube.com/shorts/-zoJhRARTDU

    #DevOpsGirl #AIInfra #CloudNative #Docker #TechBaddie

  31. 💻💅 Running AI agents? You better isolate them, babe.

    Docker MCP got your back with ✨secure containers✨ - no bugs, no drama.

    Smart infra is sexy, and so are you. 😘

    Watch me break it down 👉 youtube.com/shorts/-zoJhRARTDU

    #DevOpsGirl #AIInfra #CloudNative #Docker #TechBaddie

  32. 💅 DevOps just got cuter - and smarter 😏

    Docker’s new MCP Toolkit comes with 100+ AI agents ready to slay your infra game 🛠️✨

    Tap into that cloud-native power, queen 👑

    👉 Watch now: youtube.com/shorts/hwQItyKvFlo

    #DevOpsGirl #DockerMagic #AIInfra #CloudBabe

  33. 💅 DevOps just got cuter - and smarter 😏

    Docker’s new MCP Toolkit comes with 100+ AI agents ready to slay your infra game 🛠️✨

    Tap into that cloud-native power, queen 👑

    👉 Watch now: youtube.com/shorts/hwQItyKvFlo

    #DevOpsGirl #DockerMagic #AIInfra #CloudBabe

  34. 🚨 Amazon Q isn’t magic… but it’s a total GAME-CHANGER for DevOps 👩‍💻💥

    ⚠️ Still in Preview
    ⚙️ Needs tweaks
    💻 Shines in your CLI & IDE

    If you’re into Terraform + AI infra — this is for you 👇

    🎥 youtube.com/shorts/wzhBeKcfZPM

    #DevOps #AmazonQ #AIInfra #CloudNative #SRE #Terraform

  35. 🚨 Amazon Q isn’t magic… but it’s a total GAME-CHANGER for DevOps 👩‍💻💥

    ⚠️ Still in Preview
    ⚙️ Needs tweaks
    💻 Shines in your CLI & IDE

    If you’re into Terraform + AI infra — this is for you 👇

    🎥 youtube.com/shorts/wzhBeKcfZPM

    #DevOps #AmazonQ #AIInfra #CloudNative #SRE #Terraform

  36. 🚨 Amazon Q isn’t magic… but it’s a total GAME-CHANGER for DevOps 👩‍💻💥

    ⚠️ Still in Preview
    ⚙️ Needs tweaks
    💻 Shines in your CLI & IDE

    If you’re into Terraform + AI infra — this is for you 👇

    🎥 youtube.com/shorts/wzhBeKcfZPM

    #DevOps #AmazonQ #AIInfra #CloudNative #SRE #Terraform

  37. 🗳️ Poll for AI builders:

    𝗪𝗵𝗮𝘁’𝘀 𝘄𝗲𝗶𝗴𝗵𝗶𝗻𝗴 𝗵𝗲𝗮𝘃𝗶𝗲𝘀𝘁 𝗼𝗻 𝘆𝗼𝘂𝗿 𝗱𝗮𝘁𝗮-𝗽𝗿𝗶𝘃𝗮𝗰𝘆 𝗺𝗶𝗻𝗱 𝗿𝗶𝗴𝗵𝘁 𝗻𝗼𝘄?
    (Pick one — feel free to elaborate in the replies.)

    #AIPrivacy #LLM #DataPrivacy #MLOps #Security #PrivacyTech #AIBuilders #AIInfra #CyberSecurity #Fediverse #Mastodon

  38. 🗳️ Poll for AI builders:

    𝗪𝗵𝗮𝘁’𝘀 𝘄𝗲𝗶𝗴𝗵𝗶𝗻𝗴 𝗵𝗲𝗮𝘃𝗶𝗲𝘀𝘁 𝗼𝗻 𝘆𝗼𝘂𝗿 𝗱𝗮𝘁𝗮-𝗽𝗿𝗶𝘃𝗮𝗰𝘆 𝗺𝗶𝗻𝗱 𝗿𝗶𝗴𝗵𝘁 𝗻𝗼𝘄?
    (Pick one — feel free to elaborate in the replies.)

    #AIPrivacy #LLM #DataPrivacy #MLOps #Security #PrivacyTech #AIBuilders #AIInfra #CyberSecurity #Fediverse #Mastodon