Skip to content

stavsap/comfyui-kokoro

Repository files navigation

Comfy UI Kokoro

Buy Me A Coffee

Kokoro TTS nodes, wrapping this kokoro onnx that is based on hexgrad/Kokoro-82M.

workflow.png

note: This picture is also a workflow, just download and drop it into comfy.

Install

Install Via ComfyUI Manager, by stavsap.

img.png

Or

Clone the repo into custom_nodes folder

git clone https://github.com/stavsap/comfyui-kokoro.git

Then cd into comfyui-kokoro, and install requirements.

pip install -r requirements.txt 

And finally reboot Comfy.

The onnx model and speakers meta-data will be automatically downloaded on the first run.

If using windows portable version and experience issues with dependencies, check the following:

IMAGE ALT TEXT HERE

Nodes

Currently, there are 3 nodes that can be combined for TTS workflow.

Kokoro Speaker

speaker.png

Select supported speakers.

Kokoro Speaker Combiner

speaker_combiner.png

Combiner node to combine 2 given speakers to new speaker.

  • weight: [1, 0], select the weight of speaker a.

Example:

weight == 0.7 will result in strength of 70% of speaker_a and 30% of speaker_b.

Kokoro Generate

generator.png

  • speaker: input a speaker
  • speed: set the speach speed.
  • lang: set the language, what ever is supported by kokoro.

Available Voices

All supported voices can be found here.

Use Cases:

  1. TTS: Text To Speach, generate voice from test.

  2. Lip Sync: sync lips of videos

lipsync.png

License

  • This repo
  • kokoro-onnx: MIT
  • kokoro model: Apache 2.0

Credits

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors 2

  •  
  •  

Languages