How to Obtain Multiple Predictions for Short Single-Word Audio Using Whisper? #1848
Unanswered
GeorgeAkkarapong
asked this question in
Q&A
Replies: 1 comment
-
FYI, past discussions: |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
How can I use Whisper to predict single-word utterances from an audio file and receive multiple alternative predictions for each input, along with corresponding confidence scores? For example, if the audio file contains the word 'cheap,' I want to receive multiple predictions like ['sheep', 'cheap', 'cheese'] with corresponding confidence scores like [0.91, 0.88, 0.57].
I acknowledge that a custom classification AI model could provide a solution, but I opt for Whisper due to its extensive vocabulary of 49,000 words in its pretrained large model.
Beta Was this translation helpful? Give feedback.
All reactions