home.social

#whispercpp — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #whispercpp, aggregated by home.social.

  1. Have the practice of bookmarking content for future processing and currently working on a script that uses various services to hijack endpoints via #curl. The content is hosted on #Instagram as reels.

    One service downloads the reel while the other transcribes it.

    Now that the transcription service has a daily limit, I am wondering which approach I will take to overcome this obstacle.

    Either one can #SOCKS5 through curl onto the #Tor network to create a new connection after hitting the daily limit again.

    Or one can #whisperCpp over the downloaded reel.

  2. Have the practice of bookmarking content for future processing and currently working on a script that uses various services to hijack endpoints via #curl. The content is hosted on #Instagram as reels.

    One service downloads the reel while the other transcribes it.

    Now that the transcription service has a daily limit, I am wondering which approach I will take to overcome this obstacle.

    Either one can #SOCKS5 through curl onto the #Tor network to create a new connection after hitting the daily limit again.

    Or one can #whisperCpp over the downloaded reel.

  3. Have the practice of bookmarking content for future processing and currently working on a script that uses various services to hijack endpoints via #curl. The content is hosted on #Instagram as reels.

    One service downloads the reel while the other transcribes it.

    Now that the transcription service has a daily limit, I am wondering which approach I will take to overcome this obstacle.

    Either one can #SOCKS5 through curl onto the #Tor network to create a new connection after hitting the daily limit again.

    Or one can #whisperCpp over the downloaded reel.

  4. Have the practice of bookmarking content for future processing and currently working on a script that uses various services to hijack endpoints via #curl. The content is hosted on #Instagram as reels.

    One service downloads the reel while the other transcribes it.

    Now that the transcription service has a daily limit, I am wondering which approach I will take to overcome this obstacle.

    Either one can #SOCKS5 through curl onto the #Tor network to create a new connection after hitting the daily limit again.

    Or one can #whisperCpp over the downloaded reel.

  5. Whisper.cpp đã ra bản prototype dùng được: chuyển âm thanh sang văn bản locally (CPU/GPU), căn chỉnh từ‑từng‑từ đa ngôn ngữ, công cụ chỉnh sửa thủ công, giao diện editor mượt mà, xuất subtitle. Hoạt động offline, không phụ thuộc cloud, và dự định giữ miễn phí. Cần ý kiến về tính năng & giấy phép. #WhisperCPP #AI #Transcription #OpenSource #Vietnam #CôngCụ #FreeSoftware #TruyềnÂmThanh #AIđịaphương

    reddit.com/r/LocalLLaMA/commen

  6. Tôi đang phát triển app chuyển giọng nói thành văn bản dùng whisper.cpp + WAV2VEC2 cho đồng bộ thời gian cực chính xác (±10‑20 ms). Ứng dụng chạy locally trên CPU/GPU, xuất SRT, VTT, JSON, hỗ trợ đa ngôn ngữ. Cloud Groq chỉ ổn cho tiếng Anh, đa ngôn ngữ giảm độ chính xác. Bạn thích tốc độ nhanh (tiếng Anh) hay độ chính xác đa ngôn ngữ chậm hơn? Cần ý kiến! #AI #MachineLearning #Transcription #whispercpp #CôngNghệ #NhậnDạngGiọngNói #Vietnam

    reddit.com/r/LocalLLaMA/commen

  7. Does anybody know of a better #speechToText alternative to this?

    This feels like a terrible hack that keeps breaking. I decided to look for alternatives after I saw them using /dev/shm to store ML models.

    QuantiusBenignus/BlahST
    github.com/QuantiusBenignus/Bl

    SpeechNote (aka dsnote) does not qualify since it doesn't integrate with the clipboard.

    #STT #WhisperCPP

  8. Does anybody know of a better #speechToText alternative to this?

    This feels like a terrible hack that keeps breaking. I decided to look for alternatives after I saw them using /dev/shm to store ML models.

    QuantiusBenignus/BlahST
    github.com/QuantiusBenignus/Bl

    SpeechNote (aka dsnote) does not qualify since it doesn't integrate with the clipboard.

    #STT #WhisperCPP

  9. Does anybody know of a better #speechToText alternative to this?

    This feels like a terrible hack that keeps breaking. I decided to look for alternatives after I saw them using /dev/shm to store ML models.

    QuantiusBenignus/BlahST
    github.com/QuantiusBenignus/Bl

    SpeechNote (aka dsnote) does not qualify since it doesn't integrate with the clipboard.

    #STT #WhisperCPP

  10. Does anybody know of a better #speechToText alternative to this?

    This feels like a terrible hack that keeps breaking. I decided to look for alternatives after I saw them using /dev/shm to store ML models.

    QuantiusBenignus/BlahST
    github.com/QuantiusBenignus/Bl

    SpeechNote (aka dsnote) does not qualify since it doesn't integrate with the clipboard.

    #STT #WhisperCPP

  11. Does anybody know of a better #speechToText alternative to this?

    This feels like a terrible hack that keeps breaking. I decided to look for alternatives after I saw them using /dev/shm to store ML models.

    QuantiusBenignus/BlahST
    github.com/QuantiusBenignus/Bl

    SpeechNote (aka dsnote) does not qualify since it doesn't integrate with the clipboard.

    #STT #WhisperCPP

  12. 🚀 #Whisperphp Makes Speech Recognition Accessible in #PHP

    🔧 New #PHP binding for #Whispercpp brings powerful #AI speech recognition capabilities:
    • Supports #Linux (x86_64/arm64) and #macOS platforms with both high and low-level APIs for maximum flexibility

    github.com/CodeWithKyrian/whis

  13. 🚀 #Whisperphp Makes Speech Recognition Accessible in #PHP

    🔧 New #PHP binding for #Whispercpp brings powerful #AI speech recognition capabilities:
    • Supports #Linux (x86_64/arm64) and #macOS platforms with both high and low-level APIs for maximum flexibility

    github.com/CodeWithKyrian/whis

  14. 🚀 #Whisperphp Makes Speech Recognition Accessible in #PHP

    🔧 New #PHP binding for #Whispercpp brings powerful #AI speech recognition capabilities:
    • Supports #Linux (x86_64/arm64) and #macOS platforms with both high and low-level APIs for maximum flexibility

    github.com/CodeWithKyrian/whis

  15. 🚀 #Whisperphp Makes Speech Recognition Accessible in #PHP

    🔧 New #PHP binding for #Whispercpp brings powerful #AI speech recognition capabilities:
    • Supports #Linux (x86_64/arm64) and #macOS platforms with both high and low-level APIs for maximum flexibility

    github.com/CodeWithKyrian/whis

  16. 🚀 #Whisperphp Makes Speech Recognition Accessible in #PHP

    🔧 New #PHP binding for #Whispercpp brings powerful #AI speech recognition capabilities:
    • Supports #Linux (x86_64/arm64) and #macOS platforms with both high and low-level APIs for maximum flexibility

    github.com/CodeWithKyrian/whis

  17. @itsfoss Well, it's probably better to have #WhisperCpp integrated in #Shotcut than to wait until audio exports just to put it through AI externally again.

    #Whisper

  18. @itsfoss Well, it's probably better to have #WhisperCpp integrated in #Shotcut than to wait until audio exports just to put it through AI externally again.

    #Whisper

  19. @itsfoss Well, it's probably better to have #WhisperCpp integrated in #Shotcut than to wait until audio exports just to put it through AI externally again.

    #Whisper

  20. @itsfoss Well, it's probably better to have #WhisperCpp integrated in #Shotcut than to wait until audio exports just to put it through AI externally again.

    #Whisper

  21. @itsfoss Well, it's probably better to have integrated in than to wait until audio exports just to put it through AI externally again.

  22. Russian talk radio:

    [00:00:00.000 --> 00:00:06.140] Кто должен задать эти новые, что такое хорошо и что такое плохо? Государство?
    [00:00:06.140 --> 00:00:16.060] Я думаю, ну, какая-то государственная комиссия, ну, такая реальная комиссия, реальная, которая готова заглянуть в будущее.
    [00:00:16.060 --> 00:00:18.500] Кого мы хотим сейчас воспитать?
    [00:00:18.500 --> 00:00:23.140] Кого мы хотим воспитать? Я не очень понимаю.

    #whispercpp

  23. Russian talk radio:

    [00:00:00.000 --> 00:00:06.140] Кто должен задать эти новые, что такое хорошо и что такое плохо? Государство?
    [00:00:06.140 --> 00:00:16.060] Я думаю, ну, какая-то государственная комиссия, ну, такая реальная комиссия, реальная, которая готова заглянуть в будущее.
    [00:00:16.060 --> 00:00:18.500] Кого мы хотим сейчас воспитать?
    [00:00:18.500 --> 00:00:23.140] Кого мы хотим воспитать? Я не очень понимаю.

    #whispercpp

  24. Russian talk radio:

    [00:00:00.000 --> 00:00:06.140] Кто должен задать эти новые, что такое хорошо и что такое плохо? Государство?
    [00:00:06.140 --> 00:00:16.060] Я думаю, ну, какая-то государственная комиссия, ну, такая реальная комиссия, реальная, которая готова заглянуть в будущее.
    [00:00:16.060 --> 00:00:18.500] Кого мы хотим сейчас воспитать?
    [00:00:18.500 --> 00:00:23.140] Кого мы хотим воспитать? Я не очень понимаю.

    #whispercpp

  25. Russian talk radio:

    [00:00:00.000 --> 00:00:06.140] Кто должен задать эти новые, что такое хорошо и что такое плохо? Государство?
    [00:00:06.140 --> 00:00:16.060] Я думаю, ну, какая-то государственная комиссия, ну, такая реальная комиссия, реальная, которая готова заглянуть в будущее.
    [00:00:16.060 --> 00:00:18.500] Кого мы хотим сейчас воспитать?
    [00:00:18.500 --> 00:00:23.140] Кого мы хотим воспитать? Я не очень понимаю.

    #whispercpp

  26. @bert_hubert @hanno +1 #WhisperCpp is well-documented and straight-forwardly setup-able.

  27. @bert_hubert @hanno +1 #WhisperCpp is well-documented and straight-forwardly setup-able.

  28. @bert_hubert @hanno +1 #WhisperCpp is well-documented and straight-forwardly setup-able.

  29. @bert_hubert @hanno +1 #WhisperCpp is well-documented and straight-forwardly setup-able.

  30. @bert_hubert @hanno +1 #WhisperCpp is well-documented and straight-forwardly setup-able.

  31. Experimented with today. Results are quite impressive. I let it transcribe a 3 min snippet of an interview in German. What I noticed: -medium works significantly better than the smaller models. But it smoothens the text quite a bit, removing duplications, interjections etc., which might be undesirable for academic purposes. The OpenVINO version runs significantly faster even on an Intel GPU, but not by a magnitude. Having everything local is a huge plus for sensitive data.

  32. Experimented with #whispercpp today. Results are quite impressive. I let it transcribe a 3 min snippet of an interview in German. What I noticed: -medium works significantly better than the smaller models. But it smoothens the text quite a bit, removing duplications, interjections etc., which might be undesirable for academic purposes. The OpenVINO version runs significantly faster even on an Intel GPU, but not by a magnitude. Having everything local is a huge plus for sensitive data.

  33. Experimented with #whispercpp today. Results are quite impressive. I let it transcribe a 3 min snippet of an interview in German. What I noticed: -medium works significantly better than the smaller models. But it smoothens the text quite a bit, removing duplications, interjections etc., which might be undesirable for academic purposes. The OpenVINO version runs significantly faster even on an Intel GPU, but not by a magnitude. Having everything local is a huge plus for sensitive data.

  34. Experimented with #whispercpp today. Results are quite impressive. I let it transcribe a 3 min snippet of an interview in German. What I noticed: -medium works significantly better than the smaller models. But it smoothens the text quite a bit, removing duplications, interjections etc., which might be undesirable for academic purposes. The OpenVINO version runs significantly faster even on an Intel GPU, but not by a magnitude. Having everything local is a huge plus for sensitive data.

  35. Experimented with #whispercpp today. Results are quite impressive. I let it transcribe a 3 min snippet of an interview in German. What I noticed: -medium works significantly better than the smaller models. But it smoothens the text quite a bit, removing duplications, interjections etc., which might be undesirable for academic purposes. The OpenVINO version runs significantly faster even on an Intel GPU, but not by a magnitude. Having everything local is a huge plus for sensitive data.

  36. Seeing an epidemic of people using automatic captioning tools and not actually reviewing the output. Numerous obvious, easily fixed errors.

    This does absolutely no favors to people actually _depending_ on those captions to be accurate.

    #ai #whispercpp #accessibility

  37. Seeing an epidemic of people using automatic captioning tools and not actually reviewing the output. Numerous obvious, easily fixed errors.

    This does absolutely no favors to people actually _depending_ on those captions to be accurate.

    #ai #whispercpp #accessibility

  38. Seeing an epidemic of people using automatic captioning tools and not actually reviewing the output. Numerous obvious, easily fixed errors.

    This does absolutely no favors to people actually _depending_ on those captions to be accurate.

    #ai #whispercpp #accessibility

  39. Seeing an epidemic of people using automatic captioning tools and not actually reviewing the output. Numerous obvious, easily fixed errors.

    This does absolutely no favors to people actually _depending_ on those captions to be accurate.

    #ai #whispercpp #accessibility

  40. Seeing an epidemic of people using automatic captioning tools and not actually reviewing the output. Numerous obvious, easily fixed errors.

    This does absolutely no favors to people actually _depending_ on those captions to be accurate.

    #ai #whispercpp #accessibility

  41. what happens when you get whisper.cpp to listen to #chiptunes?

    both speak at 16KHz so they should understand each other, right?

    github.com/ggerganov/whisper.c

    Track: Alpha by @lukhash

    #C64 #MOS6581 #Speech2Text #Whispercpp

  42. what happens when you get whisper.cpp to listen to #chiptunes?

    both speak at 16KHz so they should understand each other, right?

    github.com/ggerganov/whisper.c

    Track: Alpha by @lukhash

    #C64 #MOS6581 #Speech2Text #Whispercpp

  43. what happens when you get whisper.cpp to listen to #chiptunes?

    both speak at 16KHz so they should understand each other, right?

    github.com/ggerganov/whisper.c

    Track: Alpha by @lukhash

    #C64 #MOS6581 #Speech2Text #Whispercpp

  44. what happens when you get whisper.cpp to listen to #chiptunes?

    both speak at 16KHz so they should understand each other, right?

    github.com/ggerganov/whisper.c

    Track: Alpha by @lukhash

    #C64 #MOS6581 #Speech2Text #Whispercpp

  45. Wonder how much € would be saved if people only knew about free and/or open source solutions. #whisperai #whispercpp #subtitleedit