Finetune whisper with prompts #1513

AvivSham · 2023-07-09T19:40:04Z

AvivSham
Jul 9, 2023

Hi All,
How are you?
I’m trying to finetune Whisper by resuming its pre-training task and adding initial prompts as part of the model’s forward pass. I saw this amazing tutorial, however, it does not contain a section about using prompts as part of the fine-tuning dataset.
My motivation is we witness that Whisper is not acting as expected when transcribing with prompts. Sometimes the output is blank text and on other occasions the output text contains reoccurrence. We want to solve such behaviors by fine-tuning Whisper with prompts.
Any help will be appreciated!

phineas-pta · 2023-07-09T23:20:59Z

phineas-pta
Jul 9, 2023

i dont have any exp with finetune but here my thoughts

suppose the transcript isABC and prompt is XYZ

in the section "Load WhisperTokenizer" of the finetune guide, the transcript must be tokenized into something like

<|startoftranscript|><|transcribe|>ABC<|endoftext|>

after digging into source files whisper/tokenizer.py and decoding.py, i think the tokenized prompt must prepend transcript, something like

<|startofprev|>XYZ<|startoftranscript|><|transcribe|>ABC<|endoftext|>

so i guess u have to find a way with the tokenizer to make this

0 replies

AvivNavon · 2023-07-10T06:49:27Z

AvivNavon
Jul 10, 2023

@phineas-pta Thanks for your response. What should be the labels in this example (in terms of masking etc.)? IIUC the length of labels must match the decoder_input_ids.

1 reply

phineas-pta Jul 10, 2023

what's decoder_input_ids ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Finetune whisper with prompts #1513

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Finetune whisper with prompts #1513

Uh oh!

AvivSham Jul 9, 2023

Replies: 2 comments · 1 reply

Uh oh!

phineas-pta Jul 9, 2023

Uh oh!

AvivNavon Jul 10, 2023

Uh oh!

phineas-pta Jul 10, 2023

AvivSham
Jul 9, 2023

Replies: 2 comments 1 reply

phineas-pta
Jul 9, 2023

AvivNavon
Jul 10, 2023