Automatic video editing with in-video spoken commands #569
vivekhaldar
started this conversation in
Show and tell
Replies: 1 comment 1 reply
-
Pretty cool! |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Use case: talking-head videos where you know while recording which parts you want to keep and which parts you flubbed and want to drop.
I embed spoken commands while recording. I say "victor kilo" after a segment I want to keep, "victor delta" after a segment I want to drop. The commands are not important, you can pick your own words.
Then I have a script that uses Whisper modified with stable-ts (https://github.com/jianfch/stable-ts) to get word-level timestamps, look for the commands, and output the final edited video.
Full code: https://gist.github.com/vivekhaldar/63764b085640bd82ebf07482f8a58cb8
Video that explains it, along with a short demo: https://youtu.be/SmWpi-jeAEU
Beta Was this translation helpful? Give feedback.
All reactions