Can we provide model a fake input after every 30 second window so that it can always generate text with punctuation #1307
Unanswered
FurkanGozukara
asked this question in
Q&A
Replies: 1 comment 1 reply
-
Something like #1040 may be able to help Whisper to remember the initial prompt with long audio. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
What I mean is for example for this video
https://youtu.be/v6TBtyO5Sxg
This is the output it made
You see after a while punctuation is lost. When we provide initial prompt like this it significantly improves the punctuation. So perhaps it can be repeated after punctuation is lost during transcription?
--initial_prompt "Welcome to the Software Engineering Courses channel."
@jongwook @ryanheise @guillaumekln
Beta Was this translation helpful? Give feedback.
All reactions