Real time live ASR? #925
Replies: 3 comments 2 replies
-
Hey, I've tried implementing ASR chunking with overlaps myself to get those interim results in real-time-ish transcription. The good news is, it's definitely possible! But, it isn't always the greatest. I created a sliding window to make this work. Essentially, I concatenated the current recording with a portion of the previous recording, like this: Unfortunately, this method often leads to premature endings of sentences, which in my case can completely change the meaning of the speech being transcribed. If you manage to make this work better, I'd love to hear about it. |
Beta Was this translation helpful? Give feedback.
-
Hey, thank you for your reply! That is a nice approach, thank you for sharing! |
Beta Was this translation helpful? Give feedback.
-
#1978 -- Whisper-Streaming has a real-time implementation and supports faster-whisper backend. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello, I am curious on how to implement ASR chunking for realtime transcription (with interim results so to speak). I am aware of VAD implementations, but I wanted to ask if chunking with overlaps might somehow be possible.
Thank you so much for helping me out!
Beta Was this translation helpful? Give feedback.
All reactions