I ran CrisperWhisper on a folder of 120 audio files. I wrote a for-loop, transcribing each file one at a time (i.e. batch size of 1). There are 5 audio files where I saw the following error message: The expanded size of the tensor (4) must match the existing size (5) at non-singleton dimension 0. Target sizes: [4]. Tensor sizes: [5]
I can confirm the audio files (which I can't share for PHI reasons) aren't silent. 4 of 5 had significant background noise. Please let me know if there is a fix for this.
I ran CrisperWhisper on a folder of 120 audio files. I wrote a for-loop, transcribing each file one at a time (i.e. batch size of 1). There are 5 audio files where I saw the following error message: The expanded size of the tensor (4) must match the existing size (5) at non-singleton dimension 0. Target sizes: [4]. Tensor sizes: [5]
I can confirm the audio files (which I can't share for PHI reasons) aren't silent. 4 of 5 had significant background noise. Please let me know if there is a fix for this.