Transcribing uncommon words with whisper #2169
Unanswered
OthmanePreure
asked this question in
Q&A
Replies: 2 comments
-
Post-processing with soundex or metaphone or some other form of fuzzy matching? eg. checking Whisper generated spellings against a dictionary of properly spelled station names Phonetics based Fuzzy string matching algorithms - A dive in Soundex & Metaphone |
Beta Was this translation helpful? Give feedback.
0 replies
-
@OthmanePreure Did you try the phonetics-based matching? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I'm transcribing railway related audio files, however the model gets most of the Time the name of the train stations wrong
I tried fine tuning whisper with audio files (2 hours) but the results wasn't good enough. then i tried with prompt but the size of the context is limited to 244 token which in note sufficient regarding the important number of stations.
Any tips to solve this issue (maybe using some methods with post processing)
Beta Was this translation helpful? Give feedback.
All reactions