Performance metrics for evaluating time segmentation of Whisper models #2118

mafaisalpg · 2024-04-04T20:56:50Z

mafaisalpg
Apr 4, 2024

Hello everyone,

I like to compare different Whisper models on my own datasets. My output looks like
start, end, text

Questions

What are metrics (MSE, MAE, etc.) best fit for this purpose?
Is there any Python implementation which I can reuse?
Is there any paper or article have done such comparison?

Thanks

phineas-pta · 2024-04-05T00:01:08Z

phineas-pta
Apr 5, 2024

most metrics prioritize word accuracy over timestamps accuracy

but if u want to go down the rabbit hole, the keywords to google/chatgpt are something like "metrics for alignment of time series"

0 replies

itaipee · 2024-04-07T11:24:31Z

itaipee
Apr 7, 2024

measuring timing accuracy ,are usually part of diarization evaluation , and the criteria is "diarization error rate" , and can be more complex than evaluate WER.
there is good package about diarization :
https://github.com/pyannote/pyannote-audio?tab=readme-ov-file

they has reference to how they measure their success : https://pyannote.github.io/pyannote-metrics/reference.html

1 reply

mafaisalpg Apr 8, 2024
Author

thanks. however, I am struggling to change my vad annotation file (start, end, ) to pyannote-audio compatible one for "Detection Error Rate". any suggestion on this?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Performance metrics for evaluating time segmentation of Whisper models #2118

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

Performance metrics for evaluating time segmentation of Whisper models #2118

Uh oh!

mafaisalpg Apr 4, 2024

Replies: 2 comments · 1 reply

Uh oh!

phineas-pta Apr 5, 2024

Uh oh!

itaipee Apr 7, 2024

Uh oh!

Uh oh!

mafaisalpg Apr 8, 2024 Author

mafaisalpg
Apr 4, 2024

Replies: 2 comments 1 reply

phineas-pta
Apr 5, 2024

itaipee
Apr 7, 2024

mafaisalpg Apr 8, 2024
Author