Can finetune whisper-large-v2 with data augmentation SpecAugment, Stochastic Depth, and BPE Dropout? #1547

maowandong · 2023-07-24T03:22:52Z

maowandong
Jul 24, 2023

I'm fine-tuning Whisper and noticed that Whisper-large-v2, compared to other versions of Whisper, has been trained for 2.5X longer. It also includes data augmentation techniques such as SpecAugment, Stochastic Depth, and BPE Dropout. I wonder, doesn't this extended training time of 2.5X lead to overfitting? Also, are these data augmentation techniques suitable for use during fine-tuning? If they can be used, are there any recommended open-source implementations that I can refer to?

sky-lc · 2023-07-24T03:26:09Z

sky-lc
Jul 24, 2023

Continuous attention

0 replies

mizoru · 2024-03-26T15:17:31Z

mizoru
Mar 26, 2024

Yes, too bad the implementations have not been shared

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Can finetune whisper-large-v2 with data augmentation SpecAugment, Stochastic Depth, and BPE Dropout? #1547

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Can finetune whisper-large-v2 with data augmentation SpecAugment, Stochastic Depth, and BPE Dropout? #1547

Uh oh!

maowandong Jul 24, 2023

Replies: 2 comments

Uh oh!

sky-lc Jul 24, 2023

Uh oh!

mizoru Mar 26, 2024

maowandong
Jul 24, 2023

sky-lc
Jul 24, 2023

mizoru
Mar 26, 2024