#commonvoice — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #commonvoice, aggregated by home.social.
-
#Mozilla #CommonVoice 25.0 veröffentlicht.
Common Voice Corpus, der von Mozilla gepflegte, weltweit größte freie Datensatz menschlicher Stimmen, wurde in v25.0 veröffentlicht. Wir berichteten bereits mehrfach über dieses unter der #Creative_Commons #CCO-Lizenz stehende Projekt. Das seit 2017 bestehende Projekt Common Voice fördert damit den Markt der Spracherkennung alternativ zu den großen kommerziellen Anbietern wie Amazon, Apple, Google und Microsoft...
https://linuxnews.de/mozilla-common-voice-25-0-veroeffentlicht/
-
Do you also want contribute to the Open Source movement?
Don't know where to start?- write a new program in
- C++
- rust
- Python
- goLang
- hunt for bugs and submit patches in existing programs
- choose any OS you like freeBSD openBSD netBSD Linux (win64 is fine is you work mostly there) to code for
You can't write programs (yet)?
- We need translators of programs, many programs dont have enough locale translators for languages used in projects you also use
- Interfaces to contribute are easy to follow
Want one easy example?
You must (have) use(d) google voice
It's getting enshittified (makes more and more mistakes) and is NOT opensourceCommon Voice by Mozilla
- Common Voice is Open Source
- I contribute there (years)
- You need a only phone
- I contribute in three languages (ES NL UK EN)
Don't just use Open Source software CONTRIBUTE!
Make the movement more powerful!
my stats are here (screencap)
What are yours?https://commonvoice.mozilla.org/en/dashboard/stats
#OpenSource #Voice #Common #Voice #CommonVoice #programming #rust #C #CLang #goLang #go #Linux #BSD #freeBSD #openBSD #netBSD #mac #win64
-
La projekto Common Voice de Mozilla celas instui al maŝinoj kiel homoj parolas. Ĝi nun havas novan sekcion "Respondi Demandojn" kie oni libere respondas al demandojn anstataŭ nur laŭtlegi frazojn. Donacu vian voĉon nun por pligrandigi la esperantan datumaron!
-
Notre mozillien @hellosct1 vous aide à vous lancer dans la traduction au @capitoledulibre pour #pontoon #sumo #mdn #commonvoice #cdl2025
-
Aujourd'hui, notre mozillien @hellosct1 intervient dans un atelier de traduction au @capitoledulibre sur #sumo #pontoon #mdn #commonvoice Une occasion pour vous lancer dans cette thématique #cdl2025 https://capitoledulibre.org/programme/
-
It's been another big year as I work towards completing my #dissertation on voice dataset documentation and how it influences how well #speech technologies work for all voices at the #ANU School of Cybernetics - with big thanks to my supervisors, Elizabeth Williams, Alexandra Zafiroglu, Jofish Kaye and Paul Wong 黃仲熙.
I've wrapped up a partnership with Mozilla's #CommonVoice team, which let me explore the hashtag#dataset in a lot more detail - big thanks EM Lewis-Jong, @jessie Dmitrij Feller in particular.
It was an incredible honor to keynote #FF24 at the National Film and Sound Archive of Australia alongside Peter-Lucas Jones of Te Hiku Media, expertly facilitated by Keir Winesmith - thanks @ingridbmason and team for the opportunity - and stay tuned for a little project we are working on - we know you're all eager for the video of this keynote, but we're adding a little more magic.
I helped out with @everythingopen Media and Comms this year, and am looking forward to speaking in January in Adelaide.
A huge thanks to my fellow #PhD buddies - Lorenn Ruster, @nedcpr, Glen Berman, Tom Chan, Danny Bettay, Charlotte Bradley, @Amirasadi, Memunat Ajoke Ibrahim and the later cohorts for all your support, shut up and write sessions and intellectual growth.
-
En cette dernière journée du salon #PSLXXL de @parinux, nous vous présentons #Pontoon #traductions #Nightly #CommonVoice #PDF dans Firefox
-
En cette dernière journée du salon #PSLXXL de @parinux, nous vous présentons #Pontoon #traductions #Nightly #CommonVoice #PDF dans Firefox.
-
For the past couple of years, as each new @mozilla #CommonVoice dataset of #voice #data is released, I've been using @observablehq to visualise the #metadata coverage across the 100+ languages in the dataset.
Version 17 was released yesterday (big ups to the team - EM Lewis-Jong, @jessie, Gina Moape, Dmitrij Feller) and there's some super interesting insights from the visualisation:
➡ Catalan (ca) now has more data in Common Voice than English (en) (!)
➡ The language with the highest average audio utterance duration at nearly 7 seconds is Icelandic (is). Perhaps Icelandic words are longer? I suspect so!
➡ Spanish (es), Bangla (Bengali) (bn), Mandarin Chinese (zh-CN) and Japanese (ja) all have a lot of recorded utterances that have not yet been validated. Albanian (sq) has the highest percentage of validated utterances, followed closely by Erzya / Arisa (myv).
➡ Votic (vot) has the highest percentage of invalidated utterances, but with 76% of utterances invalidated, I wonder if this language has been the target of deliberate invalidation activity (invalidating valid sentences, or recording sentences to be deliberately invalid) given the geopolitical instability in Russia currently.
See the visualisation here and let me know your thoughts below!
➡ https://observablehq.com/@kathyreid/mozilla-common-voice-v17-dataset-metadata-coverage
#linguistics #languages #data #VoiceAI #VoiceData #SpeechAI #SpeechData #DataViz
-
Last week, as part of my #PhD program at the #ANU School of #cybernetics, I gave my final presentation, which is a summary of my methods and #research findings. I covered my interview work, the #dataset documentation analysis work I've been doing and my analysis work around #accents in @mozilla's #CommonVoice platform.
There were some insightful and thought-provoking questions from my panel and audience members, and of course - so many ideas for future research inquiry!
A huge thanks to my panel, chaired so well by Professor Alexandra Zafiroglu, to Dr Elizabeth Williams, my meticulous, methodical and always-encouraging Primary Supervisor, and to my co-supervisors Dr Jofish Kaye and Dr Paul Wong 黃仲熙 for their deep expertise in #HCI and #data respectively.
Similarly, a huge thank you to my #PhD cohort - Charlotte Bradley, Tom Chan, Danny Bettay and Sam Backwell - as well as the other cohorts in the School - for your encouragement and intellectual journeying.
#PhD #PhDlife #cybernetics #milestone #ANU #voiceAI #speechAI #ASR #SpeechRecognition
-
#Signalek, ahots mezuak gailuan bertan testu bihurtzeko aukera gehituko du. Nola? #CommonVoiceko datuetan oinarrituz #Coquik sortutako #STT ereduak erabiliz:
https://www.a2p.it/tech-stuff/coquistt-signal-love-death-to-voice-messages/Signalek euskara kontuan izango duen ez dakit, baina Coquik euskararako STT eredu librea (eta bakarra?) prest utzi zuen erabili nahi duen edonorentzat:
https://coqui.ai/models
#HizkuntzaTeknologiak #SoftwareLibrea #HizketarenEzagutza #Mozilla -
Euskarazko #CommonVoice grabaketei esker #Coqui proiektukoek euskarazko STT eredu librea sortu dute:
https://librezale.eus/pipermail/librezale/2021-April/013914.htmlSortu dutena nola erabili badakizu aurrera eta kontatu emaitza!
Lagundu zuk ere grabaketekin! https://commonvoice.mozilla.org/eu