Transcribing uncommon words with whisper #2169

OthmanePreure · 2024-05-09T13:35:51Z

OthmanePreure
May 9, 2024

I'm transcribing railway related audio files, however the model gets most of the Time the name of the train stations wrong
I tried fine tuning whisper with audio files (2 hours) but the results wasn't good enough. then i tried with prompt but the size of the context is limited to 244 token which in note sufficient regarding the important number of stations.

Any tips to solve this issue (maybe using some methods with post processing)

glangford · 2024-05-09T15:44:11Z

glangford
May 9, 2024

Post-processing with soundex or metaphone or some other form of fuzzy matching? eg. checking Whisper generated spellings against a dictionary of properly spelled station names

Phonetics based Fuzzy string matching algorithms - A dive in Soundex & Metaphone

https://medium.com/data-science-in-your-pocket/phonetics-based-fuzzy-string-matching-algorithms-8399aea04718

0 replies

rahulbansal16 · 2025-03-26T04:45:18Z

rahulbansal16
Mar 26, 2025

@OthmanePreure Did you try the phonetics-based matching?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Transcribing uncommon words with whisper #2169

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Transcribing uncommon words with whisper #2169

Uh oh!

OthmanePreure May 9, 2024

Replies: 2 comments

Uh oh!

Uh oh!

glangford May 9, 2024

Phonetics based Fuzzy string matching algorithms - A dive in Soundex & Metaphone

Uh oh!

rahulbansal16 Mar 26, 2025

OthmanePreure
May 9, 2024

glangford
May 9, 2024

rahulbansal16
Mar 26, 2025