Replies: 1 comment
-
It would be better to use a dedicated voice activity detector, such as Silero VAD. The no_speech probability in whisper is not very accurate...but if you want to use just whisper, see this discussion |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi, I am only using Whisper to detect the langauge using that
but it only returns probabilities, that the given 30 sec chunk of audio has the specific language, but it does not return the no_speech probability. Is there any way to only return the no_speech probability, without using .transcribe, which is computationally expensive?
Beta Was this translation helpful? Give feedback.
All reactions