Comparing transcriptions #272
Joshfindit
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
This is quite a minor one, but it might inspire people to create more powerful tools:
To compare the transcription between models I was able to get meaningful results by:
diff
on the two filesOutput will be like:
If doing a simpler compare, all text can be normalized to lower case before comparison.
This gives a nice readable
diff
output that can be scanned through where gaps are immediately obvious as are words that are transcribed differently for each model.If someone wants to lead the project, I'd help build a tool that runs multiple models and lets the user choose the best option for each word/sentence, for those use cases where correctness is more important.
Beta Was this translation helpful? Give feedback.
All reactions