Fine-tuning Whisper Large #1247

DonDuckDuck · 2023-04-17T08:51:15Z

DonDuckDuck
Apr 17, 2023

Hi, guys, I'm trying fine-tune whisper large's performance on mandarian with huggingface transformer.

Each of my sample is about 6~7mb and I'm fine tuning it with 4 v100s, even though I set batch size to 1, pytorch still throw me oom error.

It also doesn't work when I trying to run it on a single 3090 24GB GPU with 1 batch size.

Any suggestion to get it running:)?

Answered by jongwook

May 5, 2023

The audio and the labels should be segmented into 30s or shorter chunks, to match the training distribution. I guess it should make the memory usage low enough. Using mixed precision and gradient checkpointing may further reduce the memory usage during fine-tuning.

View full answer

jongwook · 2023-05-05T09:30:46Z

jongwook
May 5, 2023
Maintainer

The audio and the labels should be segmented into 30s or shorter chunks, to match the training distribution. I guess it should make the memory usage low enough. Using mixed precision and gradient checkpointing may further reduce the memory usage during fine-tuning.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fine-tuning Whisper Large #1247

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Fine-tuning Whisper Large #1247

Uh oh!

Uh oh!

DonDuckDuck Apr 17, 2023

Replies: 1 comment

Uh oh!

jongwook May 5, 2023 Maintainer

DonDuckDuck
Apr 17, 2023

jongwook
May 5, 2023
Maintainer