Why can't we do multilanguage forced aligment without loading a language-specific alignment model? #1353

empz · 2024-10-01T06:08:37Z

empz
Oct 1, 2024

I don't know much about ML but I was able to use the following tutorial to do force aligment on multilingual transcription. The only requirement is to romanize the transcript which I did with the uroman package.
https://pytorch.org/audio/stable/tutorials/forced_alignment_for_multilingual_data_tutorial.html

According to that tutorial, it uses the Wav2Vec2 model to do this and I successfully aligned multiple languages. There's an extra step involved in mapping the aligned words back to the original word (non-romanized), but that's pretty much it.

Thoughts?

LostnD · 2024-10-10T20:35:41Z

LostnD
Oct 10, 2024

which model did you used can you tell me how to do this I wanna do it for Japanese language, because none of the japanese wav2vec2 I found working the english one works best, so it would be helpful if you share how did you used the multilingual one.

0 replies

MahmoudAshraf97 · 2024-10-14T12:01:07Z

MahmoudAshraf97
Oct 14, 2024

You can check https://github.com/MahmoudAshraf97/ctc-forced-aligner

0 replies

DushanthaS · 2025-05-07T13:56:45Z

DushanthaS
May 7, 2025

@empz I was trying to follow this, but it's not working. I am also trying to do multilingual transcription. Could you share a gist? Thank you!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Why can't we do multilanguage forced aligment without loading a language-specific alignment model? #1353

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Why can't we do multilanguage forced aligment without loading a language-specific alignment model? #1353

Uh oh!

empz Oct 1, 2024

Replies: 3 comments

Uh oh!

LostnD Oct 10, 2024

Uh oh!

MahmoudAshraf97 Oct 14, 2024

Uh oh!

DushanthaS May 7, 2025

empz
Oct 1, 2024

LostnD
Oct 10, 2024

MahmoudAshraf97
Oct 14, 2024

DushanthaS
May 7, 2025