#whispercpp — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #whispercpp, aggregated by home.social.
-
Have the practice of bookmarking content for future processing and currently working on a script that uses various services to hijack endpoints via #curl. The content is hosted on #Instagram as reels.
One service downloads the reel while the other transcribes it.
Now that the transcription service has a daily limit, I am wondering which approach I will take to overcome this obstacle.
Either one can #SOCKS5 through curl onto the #Tor network to create a new connection after hitting the daily limit again.
Or one can #whisperCpp over the downloaded reel.
-
Have the practice of bookmarking content for future processing and currently working on a script that uses various services to hijack endpoints via #curl. The content is hosted on #Instagram as reels.
One service downloads the reel while the other transcribes it.
Now that the transcription service has a daily limit, I am wondering which approach I will take to overcome this obstacle.
Either one can #SOCKS5 through curl onto the #Tor network to create a new connection after hitting the daily limit again.
Or one can #whisperCpp over the downloaded reel.
-
Have the practice of bookmarking content for future processing and currently working on a script that uses various services to hijack endpoints via #curl. The content is hosted on #Instagram as reels.
One service downloads the reel while the other transcribes it.
Now that the transcription service has a daily limit, I am wondering which approach I will take to overcome this obstacle.
Either one can #SOCKS5 through curl onto the #Tor network to create a new connection after hitting the daily limit again.
Or one can #whisperCpp over the downloaded reel.
-
Have the practice of bookmarking content for future processing and currently working on a script that uses various services to hijack endpoints via #curl. The content is hosted on #Instagram as reels.
One service downloads the reel while the other transcribes it.
Now that the transcription service has a daily limit, I am wondering which approach I will take to overcome this obstacle.
Either one can #SOCKS5 through curl onto the #Tor network to create a new connection after hitting the daily limit again.
Or one can #whisperCpp over the downloaded reel.
-
Whisper.cpp đã ra bản prototype dùng được: chuyển âm thanh sang văn bản locally (CPU/GPU), căn chỉnh từ‑từng‑từ đa ngôn ngữ, công cụ chỉnh sửa thủ công, giao diện editor mượt mà, xuất subtitle. Hoạt động offline, không phụ thuộc cloud, và dự định giữ miễn phí. Cần ý kiến về tính năng & giấy phép. #WhisperCPP #AI #Transcription #OpenSource #Vietnam #CôngCụ #FreeSoftware #TruyềnÂmThanh #AIđịaphương
https://www.reddit.com/r/LocalLLaMA/comments/1qkjrrc/whispercpp_update_answering_common_questions/
-
Tôi đang phát triển app chuyển giọng nói thành văn bản dùng whisper.cpp + WAV2VEC2 cho đồng bộ thời gian cực chính xác (±10‑20 ms). Ứng dụng chạy locally trên CPU/GPU, xuất SRT, VTT, JSON, hỗ trợ đa ngôn ngữ. Cloud Groq chỉ ổn cho tiếng Anh, đa ngôn ngữ giảm độ chính xác. Bạn thích tốc độ nhanh (tiếng Anh) hay độ chính xác đa ngôn ngữ chậm hơn? Cần ý kiến! #AI #MachineLearning #Transcription #whispercpp #CôngNghệ #NhậnDạngGiọngNói #Vietnam
-
Tìm kiếm triển khai Whisper từ đầu. Người dùng cố gắng triển khai Whisper trên thiết bị cạnh Orangepi AI Pro 20T nhưng thất bại. #Whisper #TriểnKhảiTừĐầu #Orangepi #AscendNPU #AI #MachineLearning #ỨngDụngWhisper #WhisperCPP
https://www.reddit.com/r/LocalLLaMA/comments/1ol66k5/whisper_implementation_from_scratch/
-
Does anybody know of a better #speechToText alternative to this?
This feels like a terrible hack that keeps breaking. I decided to look for alternatives after I saw them using /dev/shm to store ML models.
QuantiusBenignus/BlahST
https://github.com/QuantiusBenignus/BlahSTSpeechNote (aka dsnote) does not qualify since it doesn't integrate with the clipboard.
-
Does anybody know of a better #speechToText alternative to this?
This feels like a terrible hack that keeps breaking. I decided to look for alternatives after I saw them using /dev/shm to store ML models.
QuantiusBenignus/BlahST
https://github.com/QuantiusBenignus/BlahSTSpeechNote (aka dsnote) does not qualify since it doesn't integrate with the clipboard.
-
Does anybody know of a better #speechToText alternative to this?
This feels like a terrible hack that keeps breaking. I decided to look for alternatives after I saw them using /dev/shm to store ML models.
QuantiusBenignus/BlahST
https://github.com/QuantiusBenignus/BlahSTSpeechNote (aka dsnote) does not qualify since it doesn't integrate with the clipboard.
-
Does anybody know of a better #speechToText alternative to this?
This feels like a terrible hack that keeps breaking. I decided to look for alternatives after I saw them using /dev/shm to store ML models.
QuantiusBenignus/BlahST
https://github.com/QuantiusBenignus/BlahSTSpeechNote (aka dsnote) does not qualify since it doesn't integrate with the clipboard.
-
Does anybody know of a better #speechToText alternative to this?
This feels like a terrible hack that keeps breaking. I decided to look for alternatives after I saw them using /dev/shm to store ML models.
QuantiusBenignus/BlahST
https://github.com/QuantiusBenignus/BlahSTSpeechNote (aka dsnote) does not qualify since it doesn't integrate with the clipboard.
-
🚀 #Whisperphp Makes Speech Recognition Accessible in #PHP
🔧 New #PHP binding for #Whispercpp brings powerful #AI speech recognition capabilities:
• Supports #Linux (x86_64/arm64) and #macOS platforms with both high and low-level APIs for maximum flexibility -
🚀 #Whisperphp Makes Speech Recognition Accessible in #PHP
🔧 New #PHP binding for #Whispercpp brings powerful #AI speech recognition capabilities:
• Supports #Linux (x86_64/arm64) and #macOS platforms with both high and low-level APIs for maximum flexibility -
🚀 #Whisperphp Makes Speech Recognition Accessible in #PHP
🔧 New #PHP binding for #Whispercpp brings powerful #AI speech recognition capabilities:
• Supports #Linux (x86_64/arm64) and #macOS platforms with both high and low-level APIs for maximum flexibility -
🚀 #Whisperphp Makes Speech Recognition Accessible in #PHP
🔧 New #PHP binding for #Whispercpp brings powerful #AI speech recognition capabilities:
• Supports #Linux (x86_64/arm64) and #macOS platforms with both high and low-level APIs for maximum flexibility -
🚀 #Whisperphp Makes Speech Recognition Accessible in #PHP
🔧 New #PHP binding for #Whispercpp brings powerful #AI speech recognition capabilities:
• Supports #Linux (x86_64/arm64) and #macOS platforms with both high and low-level APIs for maximum flexibility -
@itsfoss Well, it's probably better to have #WhisperCpp integrated in #Shotcut than to wait until audio exports just to put it through AI externally again.
-
@itsfoss Well, it's probably better to have #WhisperCpp integrated in #Shotcut than to wait until audio exports just to put it through AI externally again.
-
@itsfoss Well, it's probably better to have #WhisperCpp integrated in #Shotcut than to wait until audio exports just to put it through AI externally again.
-
@itsfoss Well, it's probably better to have #WhisperCpp integrated in #Shotcut than to wait until audio exports just to put it through AI externally again.
-
@itsfoss Well, it's probably better to have #WhisperCpp integrated in #Shotcut than to wait until audio exports just to put it through AI externally again.
-
Russian talk radio:
[00:00:00.000 --> 00:00:06.140] Кто должен задать эти новые, что такое хорошо и что такое плохо? Государство?
[00:00:06.140 --> 00:00:16.060] Я думаю, ну, какая-то государственная комиссия, ну, такая реальная комиссия, реальная, которая готова заглянуть в будущее.
[00:00:16.060 --> 00:00:18.500] Кого мы хотим сейчас воспитать?
[00:00:18.500 --> 00:00:23.140] Кого мы хотим воспитать? Я не очень понимаю. -
Russian talk radio:
[00:00:00.000 --> 00:00:06.140] Кто должен задать эти новые, что такое хорошо и что такое плохо? Государство?
[00:00:06.140 --> 00:00:16.060] Я думаю, ну, какая-то государственная комиссия, ну, такая реальная комиссия, реальная, которая готова заглянуть в будущее.
[00:00:16.060 --> 00:00:18.500] Кого мы хотим сейчас воспитать?
[00:00:18.500 --> 00:00:23.140] Кого мы хотим воспитать? Я не очень понимаю. -
Russian talk radio:
[00:00:00.000 --> 00:00:06.140] Кто должен задать эти новые, что такое хорошо и что такое плохо? Государство?
[00:00:06.140 --> 00:00:16.060] Я думаю, ну, какая-то государственная комиссия, ну, такая реальная комиссия, реальная, которая готова заглянуть в будущее.
[00:00:16.060 --> 00:00:18.500] Кого мы хотим сейчас воспитать?
[00:00:18.500 --> 00:00:23.140] Кого мы хотим воспитать? Я не очень понимаю. -
Russian talk radio:
[00:00:00.000 --> 00:00:06.140] Кто должен задать эти новые, что такое хорошо и что такое плохо? Государство?
[00:00:06.140 --> 00:00:16.060] Я думаю, ну, какая-то государственная комиссия, ну, такая реальная комиссия, реальная, которая готова заглянуть в будущее.
[00:00:16.060 --> 00:00:18.500] Кого мы хотим сейчас воспитать?
[00:00:18.500 --> 00:00:23.140] Кого мы хотим воспитать? Я не очень понимаю. -
@bert_hubert @hanno +1 #WhisperCpp is well-documented and straight-forwardly setup-able.
-
@bert_hubert @hanno +1 #WhisperCpp is well-documented and straight-forwardly setup-able.
-
@bert_hubert @hanno +1 #WhisperCpp is well-documented and straight-forwardly setup-able.
-
@bert_hubert @hanno +1 #WhisperCpp is well-documented and straight-forwardly setup-able.
-
@bert_hubert @hanno +1 #WhisperCpp is well-documented and straight-forwardly setup-able.
-
Experimented with #whispercpp today. Results are quite impressive. I let it transcribe a 3 min snippet of an interview in German. What I noticed: -medium works significantly better than the smaller models. But it smoothens the text quite a bit, removing duplications, interjections etc., which might be undesirable for academic purposes. The OpenVINO version runs significantly faster even on an Intel GPU, but not by a magnitude. Having everything local is a huge plus for sensitive data.
-
Experimented with #whispercpp today. Results are quite impressive. I let it transcribe a 3 min snippet of an interview in German. What I noticed: -medium works significantly better than the smaller models. But it smoothens the text quite a bit, removing duplications, interjections etc., which might be undesirable for academic purposes. The OpenVINO version runs significantly faster even on an Intel GPU, but not by a magnitude. Having everything local is a huge plus for sensitive data.
-
Experimented with #whispercpp today. Results are quite impressive. I let it transcribe a 3 min snippet of an interview in German. What I noticed: -medium works significantly better than the smaller models. But it smoothens the text quite a bit, removing duplications, interjections etc., which might be undesirable for academic purposes. The OpenVINO version runs significantly faster even on an Intel GPU, but not by a magnitude. Having everything local is a huge plus for sensitive data.
-
Experimented with #whispercpp today. Results are quite impressive. I let it transcribe a 3 min snippet of an interview in German. What I noticed: -medium works significantly better than the smaller models. But it smoothens the text quite a bit, removing duplications, interjections etc., which might be undesirable for academic purposes. The OpenVINO version runs significantly faster even on an Intel GPU, but not by a magnitude. Having everything local is a huge plus for sensitive data.
-
Experimented with #whispercpp today. Results are quite impressive. I let it transcribe a 3 min snippet of an interview in German. What I noticed: -medium works significantly better than the smaller models. But it smoothens the text quite a bit, removing duplications, interjections etc., which might be undesirable for academic purposes. The OpenVINO version runs significantly faster even on an Intel GPU, but not by a magnitude. Having everything local is a huge plus for sensitive data.
-
Seeing an epidemic of people using automatic captioning tools and not actually reviewing the output. Numerous obvious, easily fixed errors.
This does absolutely no favors to people actually _depending_ on those captions to be accurate.
-
Seeing an epidemic of people using automatic captioning tools and not actually reviewing the output. Numerous obvious, easily fixed errors.
This does absolutely no favors to people actually _depending_ on those captions to be accurate.
-
Seeing an epidemic of people using automatic captioning tools and not actually reviewing the output. Numerous obvious, easily fixed errors.
This does absolutely no favors to people actually _depending_ on those captions to be accurate.
-
Seeing an epidemic of people using automatic captioning tools and not actually reviewing the output. Numerous obvious, easily fixed errors.
This does absolutely no favors to people actually _depending_ on those captions to be accurate.
-
Seeing an epidemic of people using automatic captioning tools and not actually reviewing the output. Numerous obvious, easily fixed errors.
This does absolutely no favors to people actually _depending_ on those captions to be accurate.
-
what happens when you get whisper.cpp to listen to #chiptunes?
both speak at 16KHz so they should understand each other, right?
https://github.com/ggerganov/whisper.cpp
Track: Alpha by @lukhash
-
what happens when you get whisper.cpp to listen to #chiptunes?
both speak at 16KHz so they should understand each other, right?
https://github.com/ggerganov/whisper.cpp
Track: Alpha by @lukhash
-
what happens when you get whisper.cpp to listen to #chiptunes?
both speak at 16KHz so they should understand each other, right?
https://github.com/ggerganov/whisper.cpp
Track: Alpha by @lukhash
-
what happens when you get whisper.cpp to listen to #chiptunes?
both speak at 16KHz so they should understand each other, right?
https://github.com/ggerganov/whisper.cpp
Track: Alpha by @lukhash
-
Wonder how much € would be saved if people only knew about free and/or open source solutions. #whisperai #whispercpp #subtitleedit