Performance metrics for evaluating time segmentation of Whisper models #2118
Replies: 2 comments 1 reply
-
most metrics prioritize word accuracy over timestamps accuracy but if u want to go down the rabbit hole, the keywords to google/chatgpt are something like "metrics for alignment of time series" |
Beta Was this translation helpful? Give feedback.
-
measuring timing accuracy ,are usually part of diarization evaluation , and the criteria is "diarization error rate" , and can be more complex than evaluate WER. they has reference to how they measure their success : https://pyannote.github.io/pyannote-metrics/reference.html |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello everyone,
I like to compare different Whisper models on my own datasets. My output looks like
start, end, text
Questions
Thanks
Beta Was this translation helpful? Give feedback.
All reactions