start of a port of kyutai's moshi text to speech #1369
Codes4Fun
started this conversation in
Show and tell
Replies: 1 comment
-
|
I've done a binary release for those interested in demoing speech-to-speech with a hallucinating ai, minimum requirements 8GB of VRAM and RTX 2070: |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
https://github.com/Codes4Fun/moshi.cpp
In it's current state, doing tts, I am getting about 3 seconds of audio from 1 second of generation time on an RTX 4090, and I think there is a lot of room to improve on that.
Beta Was this translation helpful? Give feedback.
All reactions