#speechrecognition — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #speechrecognition, aggregated by home.social.
-
#UnplugBigTech Tipp 5: Open-Source-Sprachassistent
Verabschiede dich von Alexa und anderen Sprachassistenten, die deine Gespräche mithören und auswerten. Nutze stattdessen eine datenschutzfreundliche Alternative wie OpenVoiceOS, ein Open-Source-Sprachassistent, der von einer aktiven Community weiterentwickelt wird und auf einem RaspberryPi läuft. So behältst du die Kontrolle über deine Daten.
#Alexa #OpenVoiceOS #Sprachassistent #VoiceControl #SpeechRecognition #datenschutz #privacy
-
Govorun PC: переносим офлайн-диктовку с Android на Windows за один вечер (с Claude)
На Android у меня живёт Govorun Lite — офлайн-диктовка на русском. Нажал кнопку, сказал, текст вставился. Никаких облаков, никакой отправки голоса на серверы. Работает через GigaAM v2 от Сбера. Проблема одна: на ПК такого нет. Встроенная Windows-диктовка — онлайн. Whisper — либо медленный, либо требует видеокарту. Сторонние сервисы — снова облако. Я решил портировать Govorun на Windows, и для ускорения взял Claude как пару-программиста. Что из этого вышло — в этой статье.
https://habr.com/ru/articles/1031240/
#python #speechrecognition #onnx #windows #llm #голосовой_ввод
-
Amical - Open-source AI dictation app
Cossmology Profile: https://dub.sh/Vk7tPkn
Key People: Haritabh Singh, Naomi Chopra
-
Non-lexical sounds impact ASR in clinical documentation.
🔊 NLCS: 2.4% of total words, conveying key clinical info
😷 Google's WER: 40.8%, Amazon's: 57.2% (all NLCS)
❌ Error rates for clinically relevant NLCS: Google 94.7%, Amazon 98.7%
📝 Total words: 135,647; 3284 NLCS; 76 conveyed critical data
🗣️ Described implications on documentation accuracy#ASR #ClinicalDocumentation #SpeechRecognition #AI #NLPSolutions #Pub2Post https://tnyp.me/Npmiz0F4/m
-
Learn the basics of neural networks and backpropagation: https://www.youtube.com/playlist?list=PLZHQObOWTQDNU6R1_67000Dx_ZCJB-3pi
#video #tutorial #deepLearning #LLMs #recognition #speechRecognition #visualRecognition #neuralNeworks #machineLearning
-
> Removing PulseAudio..continuing the shift to PipeWire
My #GoPiGo3 robot just shuddered in fear of becoming deaf and mute.
-
(Neural) Networking with a Business Card https://hackaday.com/2025/11/15/neural-networking-with-a-business-card/ #circuitboardbusinesscard #ArtificialIntelligence #speechrecognition #voicerecognition #businesscard #PCBHacks #rp2040
-
Bold ideas that bridge talk and tech spark change and invite your thoughts. Share your voice! #NLP #ConversationalAI #SpeechRecognition #VirtualAssistants #RealTimeTranslation #SentimentAnalysis #LanguageModels #VoiceTech #FutureTech #ArtisticVision #RealismArt
https://medium.com/@sanjay.mohindroo66/the-future-speaks-nlp-conversational-ai-empowering-our-daily-lives-2b3e1ccdfdf1 -
Bold ideas that bridge talk and tech spark change and invite your thoughts. Share your voice! #NLP #ConversationalAI #SpeechRecognition #VirtualAssistants #RealTimeTranslation #SentimentAnalysis #LanguageModels #VoiceTech #FutureTech #ArtisticVision #RealismArt
https://medium.com/@sanjay.mohindroo66/the-future-speaks-nlp-conversational-ai-empowering-our-daily-lives-2b3e1ccdfdf1 -
The Marvel of Auditory and Cognitive Networks Working Together in Your Brain
#AuditoryProcessing #BrainScience #NeuralNetworks #CognitiveScience #Hearing #SpeechRecognition #BrainPlasticity #CentralNervousSystem #SoundProcessing #Neuroscience #ListeningSkills #BrainHealth #AuditoryDisorders #LearningAndMemory
-
🌟 Excited to share Thorsten-Voice's YouTube channel! 🎥 🗣️🔊 ♿ 💬
Thorsten presents innovative TTS solutions and a variety of voice technologies, making it an excellent starting point for anyone interested in open-source text-to-speech. Whether you're a developer, accessibility advocate, or tech enthusiast, his channel offers valuable insights and resources. Don't miss out on this fantastic content! 🎬
follow hem here: @thorstenvoice
or on YouTube: https://www.youtube.com/@ThorstenMueller YouTube channel!#Accessibility #FLOSS #TTS #ParlerTTS #OpenSource #VoiceTech #TextToSpeech #AI #CoquiAI #VoiceAssistant #Sprachassistent #MachineLearning #AccessibilityMatters #FLOSS #TTS #OpenSource #Inclusivity #FOSS #Coqui #AI #CoquiAI #VoiceAssistant #Sprachassistent #VoiceTechnology #KünstlicheStimme #MachineLearning #Python #Rhasspy #TextToSpeech #VoiceTech #STT #SpeechSynthesis #SpeechRecognition #Sprachsynthese #ArtificialVoice #VoiceCloning #Spracherkennung #CoquiTTS #voice #a11y #ScreenReader
-
Goode @thorstenvoice, just found your channel and I'm impressed! Your work on TTS is fantastic and so important for accessibility in the FLOSS community. Keep it up! #AccessibilityMatters #FLOSS #TTS #OpenSource #Inclusivity #FOSS #Coqui #AI #CoquiAI #VoiceAssistant #Sprachassistent #VoiceTechnology #KünstlicheStimme #MachineLearning #Python #Rhasspy #TextToSpeech #VoiceTech #STT #SpeechSynthesis #SpeechRecognition #Sprachsynthese #ArtificialVoice #VoiceCloning #Spracherkennung #CoquiTTS #voice #a11y #ScreenReader
-
I'm exploring ways to improve audio preprocessing for speech recognition for my [midi2hamlib](https://github.com/DO9RE/midi2hamlib) project. Do any of my followers have expertise with **SoX** or **speech recognition**? Specifically, I’m seeking advice on: 1️⃣ Best practices for audio preparation for speech recognition. 2️⃣ SoX command-line parameters that can optimize audio during recording or playback.
https://github.com/DO9RE/midi2hamlib/blob/main/tests/speech_menu.sh #SoX #SpeechRecognition #OpenSource #AudioProcessing #ShellScripting #Sphinx #PocketSphinx #Audio Retoot appreciated. -
MLCommons and Hugging Face have launched a multilingual speech dataset with over one million hours of audio #AI #AIResearch #HuggingFace #MLCommons #MachineLearning #NLP #SpeechDataset #SpeechRecognition #VoiceRecognition #VoiceTech
-
Medallia acquires voice-to-text specialist Voci Technologies for $59M - M&A has largely slowed down in the current market, but there remain pockets of activity when the... more: http://feedproxy.google.com/~r/Techcrunch/~3/elRaoZlYTeM/ #artificialintelligence #customerexperience #speechrecognition #vocitechnologies #fundings&exits #enterprise #sentiment #startups #medallia #exit #m&a #tc