Replies: 2 comments
-
Ending up putting something together that converts Whisper word-level generated SRT's to segment-level SRT's by condensing sentences down to one segment and reassigning start and end timestamps in accordance to first and last words. When running Whisper with the
Install dart, or run compiled in |
Beta Was this translation helpful? Give feedback.
-
Yeah, I kind of thought that if whisper was outputting an end time, it would be the actual end time of the speech, not... the beginning of the next speech. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Currently, we get continuous/gapless segments where segments are hard "butted up" against each other (indicative by the end-time of the previous and start-time of the proceeding segment (
1 ends at 00:00:09,600 while 2 starts at 00:00:09,600 but in fact, there is a gap between them
).Not sure if it's work in progress but would love to see segments start/end times matching with speech segment lengths that are true to the respective speech segments.
On the other hand, being able to specify a duration that; if the time between each segment is less than
x
specified time, butt them up against each other. WRT subtitles, this would prevent "flashing" subtitles where the gap between them is so short that we should probably just extend it out to the next one.Beta Was this translation helpful? Give feedback.
All reactions