Can I use Whisper to output a set of phonemes with timings? #1917
blackears
started this conversation in
Show and tell
Replies: 2 comments 5 replies
-
word timestamp YES |
Beta Was this translation helpful? Give feedback.
0 replies
-
I found the Also, I had thought that phonemes would be even simpler than words, but another thread I read made it seem like it would be more difficult. If there is any library for parsing phonemes, I'd like to know about it, but if this sort of technology doesn't exist yet, it would be interesting to know why words are easier than phonemes. |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I was thinking that this might be a good tool to automatically create lip sync from audio tracks. For output, though, instead of English words I'd need a list of basic phonemes and the time they occurred measured in milliseconds. Is Whisper able to do this?
Beta Was this translation helpful? Give feedback.
All reactions