Automatic caption correction #1976
Closed
anton-codes
started this conversation in
General
Replies: 1 comment 1 reply
-
@anton-codes |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I'm using whisper to generate .vtt files for the .wav files that are generated by a TTS engine. I have the actual text that's being spoken by the TTS, I just need to use the timings to create the subtitles.
Challenge: Identify incorrect transcription and fix the mismatches with original text while maintaining time-code alignment.
Am I in the completely uncharted territory here? Anyone know any helpful NLP libraries or perhaps there is something within whisper?
example original.txt
well, they will be gone soon
example post.vtt
WEBVTT
00:00.000 --> 00:00.620
well, they are
00:00.620 --> 00:01.020
be gone sun
Beta Was this translation helpful? Give feedback.
All reactions