home.social

#voicebots — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #voicebots, aggregated by home.social.

  1. 💸 #OpenAI is going after one of the major pain points of its audio-native models: price. The newest audio model, `gpt-4o-realtime-preview-2024-12-17`, will cost 60% less than its predecessor. #gpt-4o-mini also becomes available through Realtime API, at 10x cheaper cost per token than the old gpt-4o-realtime (10$/M token input, 20$/M tokens output) [3].

    [3] openai.com/api/pricing/

    #GenAI #VoiceBots #Chatbots #AI #LLMs #Agents #RealtimeAPI

  2. 🚦 #LiveKit released a transformers-based, semantic End-of-Turn detector, #opensource on #HuggingFace[1]! This model complements voice activity detectors (#VAD) by predicting whether the user's sentence is complete. This helps reduce false starts up to 85% according to their own testing, and is text-based, with a very low latency (~50ms). Find all the details in their post [2].

    [1] huggingface.co/livekit/turn-de

    [2] blog.livekit.io/using-a-transf

    #GenAI #VoiceBots #Chatbots #AI #LLMs #Agents #RealtimeAPI

  3. 🏃‍♀️ The competition between text-based voice #bots and audio-native models is just getting tougher! Today, both #OpenAI and #LiveKit released new features, just in time for some holiday experiments 🎁

    A thread 👇

    #GenAI #VoiceBots #Chatbots #AI #LLMs #Agents #RealtimeAPI