Replies: 2 comments
-
I had the same question, too although the collected data does not seem open to the public. |
Beta Was this translation helpful? Give feedback.
0 replies
-
unfortunately no, even the training code aren't public (they only provide fine tune code) |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Thank you for amazing and open source Whisper 👍
As I understand it from the paper, you collected the proper training data from internet, and then did some preprocessing on it (re-sampling, ...). Is this training data publicly available?
I need it in the process of saprsifying the whisper's model using
sparseeml
. Of course I could use standard datasets that are used for evaluation of whisper, but whisper's robustness is based on using a diverse training dataset.Beta Was this translation helpful? Give feedback.
All reactions