#vosk — Public Fediverse posts on home.social

DerBrumme @[email protected] · 2026-05-25 · 08:04 UTC

Schwerpunkt 1
Lokales Speech to Text in Linux Mint einrichten: Auf Knopfdruck (beliebiger Shortkey) das Diktat starten & beenden, in jeder beliebigen Anwendung. Keine Cloud, kein mithörender Datenkrake, nur 4-6 GB im RAM.

Schwerpunkt 2
Wie mir KI (GPT) geholfen hat, das alles hinzubekommen, inkl. kompletter Projektdokumentation.

https://blog.derbrumme.de/lokales-speech-to-text-in-linux-mint-einrichten/

#Linux #OpenSource #SpeechToText #STT #Vosk #Privacy #SelfHosting #KI #AI #NerdDictation

#linux #opensource #speechtotext #stt #vosk #privacy

DerBrumme @[email protected] · 2026-05-25 · 08:04 UTC

Schwerpunkt 1
Lokales Speech to Text in Linux Mint einrichten: Auf Knopfdruck (beliebiger Shortkey) das Diktat starten & beenden, in jeder beliebigen Anwendung. Keine Cloud, kein mithörender Datenkrake, nur 4-6 GB im RAM.

Schwerpunkt 2
Wie mir KI (GPT) geholfen hat, das alles hinzubekommen, inkl. kompletter Projektdokumentation.

https://blog.derbrumme.de/lokales-speech-to-text-in-linux-mint-einrichten/

#Linux #OpenSource #SpeechToText #STT #Vosk #Privacy #SelfHosting #KI #AI #NerdDictation

#linux #opensource #speechtotext #stt #vosk #privacy

Xavi @[email protected] · 2026-05-10 · 20:29 UTC

I've been working in #Pitxu these days.

- I left behind the Infinite-loop approach to a Callback-based one triggered by VAD, reducing about a 40% of the load and imilar battery life improvement.

- Added a Long Term Memory system, so that I can reduce the amount of context in use per session relaying on an external support, making it faster and cheaper.

- I've switched #Vosk for #Whisper in the Speech-To-Text step, that brings an incredible improvement on the transcription, which at its turn improves the overall user experience.

- I switched from Gemini 2.5 Flash to Gemini 3.1 Flash-Lite, which improves quality but also penalizes reaction speed.

- I delegated some background work into a new Support process that takes it out from the main (user experience) thread.

- I corrected numerous visualization bugs that improves the user-Pitxu interaction.

All in all, I just had a long conversation with Pitxu, and has been by far the best demo I ever had in year and a half.

This is a self-tap-on-the-shoulder post, thank you for your attention 🙂

#pitxu #vosk #whisper

Habr @[email protected] · 2026-05-10 · 08:22 UTC

Веселимся со Spring: pet-проект по распознаванию речи

Не писал на Spring уже лет 8 и решил по фану написать мини пет проект с api и распознаванием речи. Звучит круто, лет 8-10 назад это заняло бы … вечность, тогда и llm, достаточно качественно распознающих русскую речь, да еще на скромном домашнем пк не было. В общем решил в выходной повеселиться. Погнали веселиться

https://habr.com/ru/articles/1033338/

#Java #Spring_Framework #Vosk #speech_recognition #распознавание_речи #REST_API #WAV #Java_Sound_API #pet_project #веселье

#веселье #pet_project #java_sound_api #wav #rest_api #распознавание_речи

Habr @[email protected] · 2026-04-10 · 14:42 UTC

Как устроена транскрипция в Jitsi Meet: Jigasi, SIP и путь до EMR

Когда мы проектировали пайплайн автоматического заполнения EMR по итогам видеоконсультаций, исходная гипотеза была простой: Jitsi Meet — open source, документация есть, значит, подключить бота и получить транскрипт — задача на пару дней. На практике именно этот слой занял непропорционально много времени относительно своей "очевидности". В этой статье разберу, как устроена транскрипция в Jitsi Meet под капотом, почему это не "просто включить кнопку", с какими конфигурационными нюансами пришлось столкнуться и как в итоге был выстроен пайплайн от видеозвонка до структурированного текста.

https://habr.com/ru/articles/1021992/

#jitsi_meet #jigasi #sip #xmpp #vosk #transcription #emr #fhir #llm #spring_boot

#spring_boot #llm #fhir #emr #transcription #vosk

donthatedontkill @[email protected] · 2026-03-14 · 11:44 UTC

I'm trying to set up voice control for Home Assistant.... in Esperanto! There's only, as far as I know, one local option for an Esperanto STT model able to run on a Raspberry Pi: vosk. And let me tell you, the set up (especially with dockerized home assistant) is, uh, a labor of love, let's say.
Mi sukcesos !
#homeAssistant #esperanto #vosk #stt #docker #languages

#homeassistant #esperanto #vosk #stt #docker #languages

donthatedontkill @[email protected] · 2026-03-14 · 11:44 UTC

I'm trying to set up voice control for Home Assistant.... in Esperanto! There's only, as far as I know, one local option for an Esperanto STT model able to run on a Raspberry Pi: vosk. And let me tell you, the set up (especially with dockerized home assistant) is, uh, a labor of love, let's say.
Mi sukcesos !
#homeAssistant #esperanto #vosk #stt #docker #languages

#homeassistant #esperanto #vosk #stt #docker #languages

María Arias de Reyna @[email protected] · 2026-03-13 · 13:53 UTC

Trying the speech to text engine (vosk) in Kdenlive to add subtitles to some videos I'm working on..

It is mostly right, but sometimes...

#vosk #kdenlive

María Arias de Reyna @[email protected] · 2026-03-13 · 13:53 UTC

Trying the speech to text engine (vosk) in Kdenlive to add subtitles to some videos I'm working on..

It is mostly right, but sometimes...

#vosk #kdenlive

Habr @[email protected] · 2026-02-21 · 16:22 UTC

Как я снизил WER с 33% до 3.3% для русской речи на CPU: сравнение GigaAM, Whisper и Vosk

За два месяца я перепробовал три ASR-движка, шесть моделей Whisper, адаптивное чанкование, T5-коррекцию и ансамблевое голосование — и большая часть идей оказалась тупиком. В статье — подробный разбор шести тупиков и одной находки: почему GigaAM от Сбера на обычном CPU показывает 3.3% WER на русском, обходя Whisper large-v3-turbo на RTX 4090 (7.9%) в 2.4 раза. С бенчмарками, кодом и честными оговорками.

https://habr.com/ru/articles/1002260/

#speechtotext #gigaam #whisper #vosk #onnx #распознавание_речи #WER #голосовой_ввод #ASR #python

#python #asr #голосовой_ввод #wer #распознавание_речи #onnx

Xavi @[email protected] · 2026-02-17 · 08:11 UTC

@techsimplified yes, accuracy issues indeed. The current state is good enough for the initial development tests, but the accuracy STT mistakes makes the rest of the pipeline mediocre, no matter how good it is. The input is the key.

#Vosk has been great but I feel I bump to the limits. I am testing #Whisper and should deliver punctuation and better accuracy, that translates to better interaction with the Chatbot, which brings improved user experience.

I will take a look at your suggestion, but I do focus on Voice to Text rather than Voice to action, as I aim for conversational experience more than simply executing tasks.

Thanx!

#vosk #whisper

Xavi @[email protected] · 2026-02-16 · 17:22 UTC

Marededéusinyó, 4 dies per fer que els ventiladors de la caixa del #Pitxu funcionin a diferents velocitats segons la temperatura, i en silenci.

He après molt aquesta dies. A nivell més de vida, aquest típic soroll d'aparell elèctric (el típic que ens fa canviar-lo per vell) és degut a que emet una freqüència audible, moltes vegades per error, com era el meu cas

Primer he conectat els ventiladors. Funcionen al 100%.
Després el pin de control. Tirar de llibreria GPIO per encendre'ls i apagar-los a certa temperatura.
Després aprendre d'Hysteresis, que és això de que ventili fins mes abaix del llindar per què no s'estigui encenent i apagant cada 5 segons.
Després convertir-lo a PWM, que permet variar la velocitat per que faci menys soroll.
Descobrir com funciona, i que a freqüències baixes el "zumbit" toca els ous. Massa.
Aprendre que s'ha d'usar una freqüència no-audible (~25kHz), i que la llibreria que uso explota a més de 10kHz, i el soroll no mola.
Resulta que totes les llibreries Python fan PWM per software, cal fer-ho per hardware.
La mare que va parir el Kernel de Linux, els overlays, i sa puta mare.

M'he fet un overlay jo mateix, ja tinc els canals que necessito, i ja puc moure els ventiladors a la freqüència que vull.

El #Pitxu ja respira en silenci, i prèn grans bocanades d'aire quan ho necessita.

Entre la millora del micro, el que estic cohent per canviar de #Vosk a #Whisper, i que el hardware aguanti com toca tota la infra, ja començo a tenir ganes de posar-me amb els models altra cop.

#pitxu #vosk #whisper

Xavi @[email protected] · 2026-02-16 · 14:55 UTC

@techsimplified it is, completely! I find that having my hands free to do actions (and queries) is indeed a game changer. I'm just bumping my head to make the STT to work smooth.

This project in the pic is a satellite device from my main #Pitxu ongoing built, chaining STT > Chatbot > TTS. As a satellite, it just captures sound, sends it to the "server" and plays the answer. It is a #RaspberryPiZero2 so it can't really hold all the engines needed.

As per tooling, the whole pack uses:
- #Vosk (now tinkering with #Whisper)
- #Gemini (now tinkering with #Ollama offline)
- #Piper

But a big chunk of my brain goes to the UX hardware:
- screen for a more human interaction
- soundcard I/O (gosh RPi is not yet polished here)
- GPIO buttons, UPS, PWM fan cases,...

#pitxu #raspberrypizero2 #vosk #whisper #gemini #ollama

cyclical_obsessive @cyclical_obsessive · 2025-11-24 · 18:48 UTC

@linuxiac

> Removing PulseAudio..continuing the shift to PipeWire

My #GoPiGo3 robot just shuddered in fear of becoming deaf and mute.

#espeak-ng #Vosk #SpeechRecognition #TTS #LinuxAudio

#gopigo3 #espeak #vosk #speechrecognition #tts #linuxaudio

Habr @[email protected] · 2025-11-23 · 14:42 UTC

Голосовой ввод для Windows через Vosk своими руками

Я пытался найти в Windows похожий встроенный инструмент или готовое решение, но все они либо брали на себя слишком много неактуального для меня функционала, так как задумывались для людей с ограниченными возможностями, либо были платными, либо были недоступны для русского языка. Лучшим выходом из моей ситуации было создать свое минималистичное решение, и вот как это было:

https://habr.com/ru/articles/969360/

#vosk #распознавание_речи #speechtotext #python #голосовые_интерфейсы #winapi

Habr @[email protected] · 2025-11-13 · 07:12 UTC

Без интернета и шпионов: как мы собрали локального голосового ассистента

Облачные ассистенты вроде Алисы , Google Assistant и Siri давно стали привычными. Но у всех у них одни и те же слабые места: зависимость от быстрого интернета и риск утечки данных. И речь не только о персональной информации — дома нередко обсуждают темы, которые можно отнести к коммерческой или даже военной тайне. Неудивительно, что многим некомфортно говорить в присутствии микрофона, который каждое слово отправляет куда-то «в облако» (один из наших заказчиков прямо сказал: «никаких Алис в доме не будет») . На Хабре уже появлялись статьи про попытки заменить Алису на полностью локальные решения. Но почти всегда все сводилось к стандартной схеме: ESP32-микрофон → Home Assistant → intent recognition . Такая связка работает, но до действительно «умного» ассистента ей далеко. Мы пошли дальше и собрали свой голосовой ассистент, о котором расскажем в статье.

https://habr.com/ru/companies/wirenboard/articles/965856/

#Wiren_Board #BARY #Алиса #голосовой_ассистент #распознавание_речи #vosk #Piper #Embedding #Wake_Word #умный_дом

#умный_дом #wake_word #embedding #piper #vosk #распознавание_речи

Habr @[email protected] · 2025-08-04 · 19:02 UTC

Scribe: Управляем ПК голосом. Бесплатно, оффлайн и с открытым кодом

Всем привет! Многие знают, что в Windows есть встроенная функция «Распознавание речи», а в новых версиях — «Голосовой ввод» (Win + H). Это неплохие инструменты, но меня в них всегда смущали несколько моментов: непрозрачность в вопросах приватности, ограниченная кастомизация и глубокая интеграция в систему, которую не всегда удобно настраивать. Хотелось чего-то простого, гарантированно оффлайнового и с открытым исходным кодом, чтобы точно знать, как оно работает. Так родилась идея создать Scribe — полностью автономного и максимально гибкого голосового ассистента. В основе — приватность, автономность и гибкость. Я постарался реализовать функции, которых мне не хватало в других программах.

https://habr.com/ru/articles/933968/

#распознавание_речи #голосовое_управление #vosk #pyqt5 #windows #open_source

#open_source #windows #pyqt5 #vosk #голосовое_управление #распознавание_речи

mʕ•ﻌ•ʔm bitPickup @[email protected] · 2025-07-23 · 06:58 UTC

@[email protected]
> to evolve AI tools, for voice over for example

Actually I just went back to use #ubuntuStudio 2022.04lts instead of 2024.04lts.
Looks like #VOSK and #deepSeek will be next:
https://www.youtube.com/watch?v=T7sR-4DFhpQ

https://github.com/ideasman42/nerd-dictation

https://citizix.com/how-to-setup-deepseek-locally-on-ubuntu-22.04-server-with-ollama/
https://linuxblog.io/install-deepseek-linux/

@mina @resl

#ubuntustudio #vosk #deepseek

Habr @[email protected] · 2025-05-15 · 20:32 UTC

Добавление слов в языковую модель Vosk

Краткий гайд как дополнить vosk модель распознавания речи своими словами. Для дальнейшего использования в своих проектах. Все подводные камни в использовании инструмента kaldi в 2025 году Принять испытание

https://habr.com/ru/articles/909788/

#vosk #kaldi #адаптация_модели_vosk #распознавание_речи

#распознавание_речи #адаптация_модели_vosk #kaldi #vosk

athmane mokraoui [BoF] ⏚ꝃñ⌁⁂ @[email protected] · 2025-03-21 · 19:43 UTC

Un anaouder mouezh emgefre, graet gant ar meziantoù Anaouder -version 1.0.0, Kaldi ha Vosk

Fait avec les logiciels open source Anaouder, Kaldi et Vosk.

Istitlañ un video - Sous-titrer une vidéo en breton.

Lien : https://abp.bzh/anaouder/istitlan.php

#Breton #BZH #VOSK #Kaldi #Bretagne

#bretagne #kaldi #vosk #bzh #breton

athmane mokraoui [BoF] ⏚ꝃñ⌁⁂ @[email protected] · 2025-03-21 · 19:43 UTC

Un anaouder mouezh emgefre, graet gant ar meziantoù Anaouder -version 1.0.0, Kaldi ha Vosk

Fait avec les logiciels open source Anaouder, Kaldi et Vosk.

Istitlañ un video - Sous-titrer une vidéo en breton.

Lien : https://abp.bzh/anaouder/istitlan.php

#Breton #BZH #VOSK #Kaldi #Bretagne

#bretagne #kaldi #vosk #bzh #breton

athmane mokraoui [BoF] ⏚ꝃñ⌁⁂ @[email protected] · 2025-03-21 · 19:40 UTC

#Anaouder

Reconnaissance vocale pour le breton avec Vosk.

Lien : https://pypi.org/project/anaouder/

#Breton #BZH #Vosk #Kaldi

#kaldi #vosk #bzh #breton #anaouder

athmane mokraoui [BoF] ⏚ꝃñ⌁⁂ @[email protected] · 2025-03-21 · 19:40 UTC

#Anaouder

Reconnaissance vocale pour le breton avec Vosk.

Lien : https://pypi.org/project/anaouder/

#Breton #BZH #Vosk #Kaldi

#kaldi #vosk #bzh #breton #anaouder

athmane mokraoui [BoF] ⏚ꝃñ⌁⁂ @[email protected] · 2025-03-18 · 14:19 UTC

ibus-speech-to-text will provide voice dictation capabilities to any application supporting IBus input methods in #Fedora Linux 42, using VOSK for local voice recognition.

🔗 https://fedoraproject.org/wiki/Changes/ibus-speech-to-text

#ibus #STT #SpeechToText #VOSK

#vosk #speechtotext #stt #ibus #fedora

athmane mokraoui [BoF] ⏚ꝃñ⌁⁂ @[email protected] · 2025-03-18 · 14:19 UTC

ibus-speech-to-text will provide voice dictation capabilities to any application supporting IBus input methods in #Fedora Linux 42, using VOSK for local voice recognition.

🔗 https://fedoraproject.org/wiki/Changes/ibus-speech-to-text

#ibus #STT #SpeechToText #VOSK

#vosk #speechtotext #stt #ibus #fedora

Habr @[email protected] · 2025-02-03 · 11:32 UTC

Свой Google в локалке. Ищем иголку в стоге сена

В статье мы разработаем свой собственный Google, который можно будет запустить в любой локальной сети как атакующим, что ищут пароли, так и защитникам, которым небезразлична безопасность их родной локалки. И что примечательно, наш Google будет состоять на 99% из готовых компонентов, практически без дополнительного программирования. А внедрение такой системы потребует ввода всего пары команд.

https://habr.com/ru/companies/ussc/articles/878340/

#active_directory #google #smb #tesseract #vosk #csv #gnu #ftp #краулинг #dcap

#dcap #краулинг #ftp #gnu #csv #vosk

Tykayn @[email protected] · 2025-01-15 · 22:35 UTC

qui qui veut de la transcription de vidéo ou de fichier audio faite avec du logiciel libre ?
Scribe - Ceméa

https://scribe.cemea.org

vous pouvez aussi l'auto héberger, et ça fonctionne sans enrichir de milliardaire facho.

#vosk #scribe #cemea #logicielLibre

#vosk #scribe #cemea #logiciellibre

Tykayn @[email protected] · 2025-01-15 · 22:35 UTC

qui qui veut de la transcription de vidéo ou de fichier audio faite avec du logiciel libre ?
Scribe - Ceméa

https://scribe.cemea.org

vous pouvez aussi l'auto héberger, et ça fonctionne sans enrichir de milliardaire facho.

#vosk #scribe #cemea #logicielLibre

#vosk #scribe #cemea #logiciellibre

Angelo Veltens 🏳️‍🌈 @[email protected] · 2024-12-05 · 20:45 UTC

I am really impressed by both the speed and accuracy of #vosk speech-to-text on a Raspberry Pi 5. This is really usable. #Whisper was either far too inaccurate (at least for german) or unusable slow with larger, more accurate models.

Did you try any of these? What are your experiences?

#HomeAssistant

#vosk #whisper #homeassistant

Angelo Veltens 🏳️‍🌈 @[email protected] · 2024-12-05 · 20:45 UTC

I am really impressed by both the speed and accuracy of #vosk speech-to-text on a Raspberry Pi 5. This is really usable. #Whisper was either far too inaccurate (at least for german) or unusable slow with larger, more accurate models.

Did you try any of these? What are your experiences?

#HomeAssistant

#vosk #whisper #homeassistant

Vincent-Xavier ⏚ @[email protected] · 2024-12-02 · 09:49 UTC

Les accidents de la vie c'est l'occasion d'utiliser de nouvelles choses. Avoir un bras dans le plâtre et pouvoir moins écrire a été l'occasion pour moi d'essayer la synthèse audio. Avec le logiciel libre vosk installé sur mon téléphone je peux désormais dicter mes textes au lieu de les écrire.
J'ai ainsi pu dicter mes appréciation plutôt que de les écrire et je n'ai plus qu'à corriger la syntaxe parfois défaillante.
#voskapi #voskspeech #vosk

#voskapi #voskspeech #vosk

Vincent-Xavier ⏚ @[email protected] · 2024-12-02 · 09:49 UTC

Les accidents de la vie c'est l'occasion d'utiliser de nouvelles choses. Avoir un bras dans le plâtre et pouvoir moins écrire a été l'occasion pour moi d'essayer la synthèse audio. Avec le logiciel libre vosk installé sur mon téléphone je peux désormais dicter mes textes au lieu de les écrire.
J'ai ainsi pu dicter mes appréciation plutôt que de les écrire et je n'ai plus qu'à corriger la syntaxe parfois défaillante.
#voskapi #voskspeech #vosk

#voskapi #voskspeech #vosk

R. L. Dane :debian: :openbsd: @RL_Dane · 2024-11-04 · 02:12 UTC

@nigel @[email protected]

I haven't tried swipe typing a whole lot with FUTO. The best thing it has going for it is the voice dictation, which is far better than any of the #Vosk-based FOSS options that are out there now, like #SayBoard.

I agree that #Heliboard is simply the best FOSS keyboard out there, hands down.

I find myself using gboard (firewalled off from the internet, of course) recently, as I can just fire away at full speed with minimal corrections afterward (two-thumb typing).

... 1/2

#heliboard #vosk #sayboard

R. L. Dane :debian: :openbsd: @[email protected] · 2024-11-04 · 02:12 UTC

@nigel @plym

I haven't tried swipe typing a whole lot with FUTO. The best thing it has going for it is the voice dictation, which is far better than any of the #Vosk-based FOSS options that are out there now, like #SayBoard.

I agree that #Heliboard is simply the best FOSS keyboard out there, hands down.

I find myself using gboard (firewalled off from the internet, of course) recently, as I can just fire away at full speed with minimal corrections afterward (two-thumb typing).

... 1/2

#vosk #sayboard #heliboard

Tykayn @[email protected] · 2024-10-30 · 11:53 UTC

quelqu'un a déjà fait un truc avec #vosk qui permet de distinguer les locuteurs dans la #transcription ?

#vosk #transcription

Tykayn @[email protected] · 2024-10-30 · 11:53 UTC

quelqu'un a déjà fait un truc avec #vosk qui permet de distinguer les locuteurs dans la #transcription ?

#vosk #transcription

Andresimous @[email protected] · 2024-09-25 · 17:16 UTC

Wie versprochen schiebe ich mal ein kleines #Tutorial zu #Vosk rein. Mit Vosk könnt Ihr #Untertitel zu Videos erzeugen & Audio-Dateien transkribieren. Vosk ist also ein #SpeechToText Programm..
Auf der offiziellen Vosk-Webseite steht als Installationsanleitung:
- Installiere die Pakete Python3, pip3 und ffmpeg
- Installiere Vosk mit dem Befehl: pip3 install vosk

Doch das funktionierte bei mir auf Linux Mint nicht, denn nach dem pip3-Befehl konnte Vosk nicht gestartet werden.
1/x

#tutorial #vosk #untertitel #speechtotext

Andresimous @[email protected] · 2024-09-25 · 17:16 UTC

Wie versprochen schiebe ich mal ein kleines #Tutorial zu #Vosk rein. Mit Vosk könnt Ihr #Untertitel zu Videos erzeugen & Audio-Dateien transkribieren. Vosk ist also ein #SpeechToText Programm..
Auf der offiziellen Vosk-Webseite steht als Installationsanleitung:
- Installiere die Pakete Python3, pip3 und ffmpeg
- Installiere Vosk mit dem Befehl: pip3 install vosk

Doch das funktionierte bei mir auf Linux Mint nicht, denn nach dem pip3-Befehl konnte Vosk nicht gestartet werden.
1/x

#tutorial #vosk #untertitel #speechtotext

Andresimous @[email protected] · 2024-09-24 · 19:21 UTC

Ich bin gerade dabei, Untertitel für meine Reisevideos erzeugen zu lassen. Leider versteht #Vosk meinen Dialekt nicht so ganz.... 😄
Es wird doch ein ganz harmloses und jugendfreies Reisefilmchen....

#vosk

barefootstache @[email protected] · 2024-08-07 · 08:18 UTC

#DailyBloggingChallenge (362/365)

Originally wanted to use #VOSK to transcribe the #SpeechToText. Initially tried it out over #KdenLive and its ‘Speech Recognition’ tool.

This took quite awhile to setup, since it is not concrete what kind file format, if any, the VOSK model should have. Additionally, the recommendation of setting up a virtual #Python environment didn’t work as expect and went with the global approach.

And finally scratched the whole approach, once realizing that transcribing 26 min audio clip is taking longer than 10min.

#dailybloggingchallenge #speechtotext #kdenlive #vosk #python

Найменшенький @[email protected] · 2024-07-19 · 17:45 UTC

Dicio assistant - багатомовний голосовий асистент під Andoid з відкритим кодом.

Наданий момент українська мова відсутня, але робота над цією проблемою у процесі. І я хочу попросити вас про допомогу в локалізації. Інтерфейс додатка на WebLate вже перекладений, а от внутрішнє розпізнавання команд поки не повністю. Щоб розпізнавання було кращим і точнішим я прошу в вас допомоги. Я створив відгалуження репозиторію і вже переклав деякі навички, інші ще в процесі. Ви можете переглянути вже присутній переклад і запропонувати виправлення чи додати щось нове до нього, або створивши ще одне відгалуження, або просто написати мені тут і я додам ці зміни.

#foss #fdroid #android #assistant #voiceassistant #vosk #голосовий_асистент #асистент #локалізація #переклад #українізація

#android #assistant #fdroid #foss #voiceassistant #vosk

Tykayn @[email protected] · 2024-05-30 · 15:26 UTC

Speech to Text — #Kdenlive Manual 24.05 documentation
#stt #vosk #transcription

https://docs.kdenlive.org/en/effects_and_compositions/speech_to_text.html

#kdenlive #stt #vosk #transcription

Найменшенький @[email protected] · 2024-03-18 · 15:36 UTC

Sayboard - Голосовий IME (клавіатура) з відкритим вихідним кодом для Android

Розпізнавання відбувається повністю локально за допомоги бібліотеки Vosk.

Завантажити мовні моделі можна тут.

#foss #android #fdroid #keyboard #vosk #voice #stt #клавіатура #голос_у_текст #text_to_speach

#android #fdroid #foss #keyboard #stt #text_to_speach

Minkiu @[email protected] · 2023-11-02 · 00:34 UTC

I just stumbled upon Sayboard on F-Droid, an on-device voice IME for #android that uses the #Vosk library, where you download the models and it's privacy friendly
https://github.com/ElishaAz/Sayboard

#android #vosk