How to Obtain Multiple Predictions for Short Single-Word Audio Using Whisper? #1848

GeorgeAkkarapong · 2023-11-28T16:09:19Z

GeorgeAkkarapong
Nov 28, 2023

How can I use Whisper to predict single-word utterances from an audio file and receive multiple alternative predictions for each input, along with corresponding confidence scores? For example, if the audio file contains the word 'cheap,' I want to receive multiple predictions like ['sheep', 'cheap', 'cheese'] with corresponding confidence scores like [0.91, 0.88, 0.57].

I acknowledge that a custom classification AI model could provide a solution, but I opt for Whisper due to its extensive vocabulary of 49,000 words in its pretrained large model.

glangford · 2023-11-28T16:34:08Z

glangford
Nov 28, 2023

FYI, past discussions:

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to Obtain Multiple Predictions for Short Single-Word Audio Using Whisper? #1848

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to Obtain Multiple Predictions for Short Single-Word Audio Using Whisper? #1848

Uh oh!

Uh oh!

GeorgeAkkarapong Nov 28, 2023

Replies: 1 comment

Uh oh!

glangford Nov 28, 2023

GeorgeAkkarapong
Nov 28, 2023

glangford
Nov 28, 2023