home.social

#speechrecognition — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #speechrecognition, aggregated by home.social.

  1. #UnplugBigTech Tipp 5: Open-Source-Sprachassistent

    Verabschiede dich von Alexa und anderen Sprachassistenten, die deine Gespräche mithören und auswerten. Nutze stattdessen eine datenschutzfreundliche Alternative wie OpenVoiceOS, ein Open-Source-Sprachassistent, der von einer aktiven Community weiterentwickelt wird und auf einem RaspberryPi läuft. So behältst du die Kontrolle über deine Daten.

    openvoiceos.org/

    #Alexa #OpenVoiceOS #Sprachassistent #VoiceControl #SpeechRecognition #datenschutz #privacy

  2. Govorun PC: переносим офлайн-диктовку с Android на Windows за один вечер (с Claude)

    На Android у меня живёт Govorun Lite — офлайн-диктовка на русском. Нажал кнопку, сказал, текст вставился. Никаких облаков, никакой отправки голоса на серверы. Работает через GigaAM v2 от Сбера. Проблема одна: на ПК такого нет. Встроенная Windows-диктовка — онлайн. Whisper — либо медленный, либо требует видеокарту. Сторонние сервисы — снова облако. Я решил портировать Govorun на Windows, и для ускорения взял Claude как пару-программиста. Что из этого вышло — в этой статье.

    habr.com/ru/articles/1031240/

    #python #speechrecognition #onnx #windows #llm #голосовой_ввод

  3. Amical - Open-source AI dictation app

    Cossmology Profile: dub.sh/Vk7tPkn

    Key People: Haritabh Singh, Naomi Chopra

    #SpeechRecognition #OpenSource #OSS #COSS

  4. Non-lexical sounds impact ASR in clinical documentation.

    🔊 NLCS: 2.4% of total words, conveying key clinical info
    😷 Google's WER: 40.8%, Amazon's: 57.2% (all NLCS)
    ❌ Error rates for clinically relevant NLCS: Google 94.7%, Amazon 98.7%
    📝 Total words: 135,647; 3284 NLCS; 76 conveyed critical data
    🗣️ Described implications on documentation accuracy

    #ASR #ClinicalDocumentation #SpeechRecognition #AI #NLPSolutions #Pub2Post tnyp.me/Npmiz0F4/m

  5. @linuxiac

    > Removing PulseAudio..continuing the shift to PipeWire

    My robot just shuddered in fear of becoming deaf and mute.

    -ng

  6. 🌟 Excited to share Thorsten-Voice's YouTube channel! 🎥 🗣️🔊 ♿ 💬

    Thorsten presents innovative TTS solutions and a variety of voice technologies, making it an excellent starting point for anyone interested in open-source text-to-speech. Whether you're a developer, accessibility advocate, or tech enthusiast, his channel offers valuable insights and resources. Don't miss out on this fantastic content! 🎬

    follow hem here: @thorstenvoice
    or on YouTube: youtube.com/@ThorstenMueller YouTube channel!

    #Accessibility #FLOSS #TTS #ParlerTTS #OpenSource #VoiceTech #TextToSpeech #AI #CoquiAI #VoiceAssistant #Sprachassistent #MachineLearning #AccessibilityMatters #FLOSS #TTS #OpenSource #Inclusivity #FOSS #Coqui #AI #CoquiAI #VoiceAssistant #Sprachassistent #VoiceTechnology #KünstlicheStimme #MachineLearning #Python #Rhasspy #TextToSpeech #VoiceTech #STT #SpeechSynthesis #SpeechRecognition #Sprachsynthese #ArtificialVoice #VoiceCloning #Spracherkennung #CoquiTTS #voice #a11y #ScreenReader

  7. I'm exploring ways to improve audio preprocessing for speech recognition for my [midi2hamlib](github.com/DO9RE/midi2hamlib) project. Do any of my followers have expertise with **SoX** or **speech recognition**? Specifically, I’m seeking advice on: 1️⃣ Best practices for audio preparation for speech recognition. 2️⃣ SoX command-line parameters that can optimize audio during recording or playback.
    github.com/DO9RE/midi2hamlib/b #SoX #SpeechRecognition #OpenSource #AudioProcessing #ShellScripting #Sphinx #PocketSphinx #Audio Retoot appreciated.

  8. Medallia acquires voice-to-text specialist Voci Technologies for $59M - M&A has largely slowed down in the current market, but there remain pockets of activity when the... more: feedproxy.google.com/~r/Techcr #artificialintelligence #customerexperience #speechrecognition #vocitechnologies #fundings&exits #enterprise #sentiment #startups #medallia #exit #m&a #tc