sentence level language detection #2023
josephwong14wkh
started this conversation in
General
Replies: 2 comments
-
See the ongoing discussion and example code here |
Beta Was this translation helpful? Give feedback.
0 replies
-
I’m afraid you’ll have to split the audio in sentences yourself, save in separate files, then use Whisper for each. It’s not difficult to write a small script and do this routinely. For splitting in sentences, you can use the "Analyze/Stille-Finder" function in Audacity, or the python library pydub. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Did anyone has idea on sentence-level language detection? I am currently trying to finetune whisper and hope that it can detect multiple languages given an audio. For example, given a 10s audio with two speakers , if one speaker speaks english in first 5s and another speaker speaks Mandarin in the next 5s, i want whisper to detect "en" for the first 5s and "zh" for the next 5s.
Beta Was this translation helpful? Give feedback.
All reactions