Replies: 7 comments 12 replies
-
Hi. |
Beta Was this translation helpful? Give feedback.
-
I read somewhere that to fine tune on this task you can follow this guide fine tuning whisper and just change the dataset and set translation as task instead of transcribe. I need to do the same but I still didn't tried. My worry is the calculation of WER, how can the wer be calculated in the task of translation? There are a lot of possibilities that the text predicted will have the same meaning as the real text but with differents and then the WER will mislead. Did you manage it? |
Beta Was this translation helpful? Give feedback.
-
@emanueleielo For evaluation, WER isn't a good metric for translation. You want to use one of the translation metrics like BLEU Score, METEOR, COMET or similar. Hope that helps! |
Beta Was this translation helpful? Give feedback.
-
Hi @AmgadHasan |
Beta Was this translation helpful? Give feedback.
-
Hi, @AmgadHasan , can you share how you prepare custom data? For example, let's say language pair is en->hi.
would be
right? |
Beta Was this translation helpful? Give feedback.
-
I worked on a project where I fine-tuned Whisper-Tiny for translation tasks, and it worked well. You can check out my repo to see how I did it, and it might help you fix the issue you're having. I have mentioned all the steps in README.md. |
Beta Was this translation helpful? Give feedback.
-
Hello, I’m currently working on a translation project involving Wolof, an underrepresented African language. I have a dataset consisting of audio files in Wolof and their corresponding translations in French. My question is: I would like to confirm whether such a fine-tuning approach is technically valid with Whisper (e.g., using task="translate"), and whether others have had success with similar setups. Thank you very much for your support and insights 🙏 |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I have an audio dataset of specific domain in the Hindi language, and I want to enhance the whisper translation capabilities of my model. Currently, it can take non-English audio input and translate it into English text.
I understand how to fine-tune the whisper model for transcription tasks like writing same language text as in audio but I'm not sure how to fine-tune it specifically for cross-lingual translation when audio is in another language and we want to improve translation to English performance of whisper model. Could you provide guidance on how to fine-tune the model for this purpose or share any repo?
Beta Was this translation helpful? Give feedback.
All reactions