sentence level language detection #2023

josephwong14wkh · 2024-02-16T03:45:29Z

josephwong14wkh
Feb 16, 2024

Did anyone has idea on sentence-level language detection? I am currently trying to finetune whisper and hope that it can detect multiple languages given an audio. For example, given a 10s audio with two speakers , if one speaker speaks english in first 5s and another speaker speaks Mandarin in the next 5s, i want whisper to detect "en" for the first 5s and "zh" for the next 5s.

glangford · 2024-02-18T14:35:14Z

glangford
Feb 18, 2024

See the ongoing discussion and example code here

Multi-Language Audio and Transcription Inconsistencies #2009

0 replies

gyllila · 2024-02-22T15:05:42Z

gyllila
Feb 22, 2024

I’m afraid you’ll have to split the audio in sentences yourself, save in separate files, then use Whisper for each. It’s not difficult to write a small script and do this routinely. For splitting in sentences, you can use the "Analyze/Stille-Finder" function in Audacity, or the python library pydub.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

sentence level language detection #2023

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

sentence level language detection #2023

Uh oh!

josephwong14wkh Feb 16, 2024

Replies: 2 comments

Uh oh!

glangford Feb 18, 2024

Uh oh!

gyllila Feb 22, 2024

josephwong14wkh
Feb 16, 2024

glangford
Feb 18, 2024

gyllila
Feb 22, 2024