Skip to content
Discussion options

You must be logged in to vote

Hey @marthafikry! Cool to see that you're fine-tuning Whisper for Arabic!

The issue is with your target label sequences. Some of the label sequences have a length that exceeds the model’s maximum generation length. These must be very long sequences, as the maximum generation length is 448. This is the longest sequence the model is configured to handle (model.config.max_length).

We've got two options here:

  1. Filter any label sequences longer than max length
  2. Increase the models' max length

What we can do is compute the labels length of each target sequence:

def prepare_dataset(batch):
    # load and resample audio data from 48 to 16kHz
    audio = batch["audio"]

    # compute input length

Replies: 2 comments 5 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
5 replies
@marthafikry
Comment options

@huynhthanh98
Comment options

@huynhthanh98
Comment options

@sanchit-gandhi
Comment options

@ATISHAYS99
Comment options

Answer selected by jongwook
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
5 participants