已支持CosyVoice2、SenseVoice、InspireMusic

新增内容：

初步支持了inspiremusic，还没有经过严格测试（推理时有问题可以尝试换模型）。相应的新增了依赖，同时需要安装flash-attention（不使用InspireMusic可以不装），windows系统的whl可以从这里下载：

https://huggingface.co/lldacing/flash-attention-windows-wheel/tree/main
https://github.com/bdashore3/flash-attention/releases

新增了是否自动下载模型的选项
新增了多音字替换功能，配置在funaudio_utils/多音字纠正配置.txt。感谢https://github.com/touge/ComfyUI-NCE_CosyVoice/tree/main
新增了3个CosyVoice2节点。
整理了节点组。
从官方更新了CosyVoice、SenseVoice、match。
补充了更新CosyVoice后新增的参数text_frontend，作用应该是规范化文本，默认为True。
优化了Speaker模型的保存与加载。
因为CosyVoice2需要，采样率22050几乎全部改为了24000。

使用说明：

工作流详见示例workflow
建议自动下载模型，不熟悉的话容易重复下载（下载好之后强烈建议关闭自动下载）。
手动下载请参考官方CosyVoice、SenseVoice、InspireMusic。手动下载后需要将inspiremusic.yaml文件里的'../../pretrained_models'替换成自己的模型路径。
Speaker模型默认存储在 /models/CosyVoice/Speaker
当以Speaker模型做为输入时，保存模型依然生效，但是保存的模型应该没有数据。

安装注意事项：

Windows系统需要使用conda虚拟环境。
试验下来python3.12也能用，推荐使用python3.10，torch<=2.4.1
pynini2.1.6轮子下载
pynini2.1.6可能会有问题（可能需要更高的python版本，比如3.12），2.1.5安装方法：

conda install -y -c conda-forge pynini==2.1.5 
python -m pip install WeTextProcessing --no-deps
python -m pip install -r requirements.txt

如果报错缺模块就自行安装。
安装ffmpeg，并将ffmpeg.exe所在文件夹添加到环境变量。

conda虚拟环境使用方式

在ComfyUI同级目录（与官方批处理同文件夹）创建批处理文件，内容如下：

@echo off

:: 切换到 ComfyUI 目录
cd ComfyUI

:: 激活你的 Conda 虚拟环境
call conda activate your-env

:: 运行 Python 脚本
python -s main.py --windows-standalone-build --fast

pause

或者直接覆盖官方批处理的内容。更多conda使用请自行学习。

以下是原项目说明：

ComfyUI-FunAudioLLM

Comfyui custom node for FunAudioLLM include CosyVoice and SenseVoice

Features

CosyVoice

CosyVoice Version: 2024-10-04
Support SFT,Zero-shot,Cross-lingual,Instruct
Support CosyVoice-300M-25Hz in zero-shot and cross-lingual
Support SFT's 25Hz(unoffical)
Save and load speaker model in zero-shot

SenseVoice

SenseVoice Version: 2024-10-04
Support SenseVoice-Small
Support Punctuation segment (need turn off use_fast_mode)

How use

apt update
apt install ffmpeg

## in ComfyUI/custom_nodes
git clone https://github.com/SpenserCai/ComfyUI-FunAudioLLM
cd ComfyUI-FunAudioLLM
pip install -r requirements.txt

Windows

In windows need use conda to install pynini

conda install -c conda-forge pynini=2.1.6
pip install -r requirements.txt

MacOS

If meet error when you install

brew install openfst
export CPPFLAGS="-I/opt/homebrew/include"
export LDFLAGS="-L/opt/homebrew/lib"
pip install -r requirements.txt

If your network is unstable, you can pre-download the model from the following sources and place it in the appropriate directory.

CosyVoice-300M -> ComfyUI/models/CosyVoice/CosyVoice-300M
CosyVoice-300M-25Hz -> ComfyUI/models/CosyVoice/CosyVoice-300M-25Hz
CosyVoice-300M-SFT -> ComfyUI/models/CosyVoice/CosyVoice-300M-SFT
CosyVoice-300M-SFT-25Hz -> ComfyUI/models/CosyVoice/CosyVoice-300M-SFT-25Hz
CosyVoice-300M-Instruct -> ComfyUI/models/CosyVoice/CosyVoice-300M-Instruct
SenseVoiceSmall -> ComfyUI/models/SenseVoice/SenseVoiceSmall

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
.github/workflows		.github/workflows
assets		assets
cosyvoice		cosyvoice
funaudio_utils		funaudio_utils
inspiremusic		inspiremusic
matcha		matcha
nodes		nodes
sensevoice		sensevoice
web		web
workflow		workflow
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

已支持CosyVoice2、SenseVoice、InspireMusic

新增内容：

使用说明：

安装注意事项：

conda虚拟环境使用方式

以下是原项目说明：

ComfyUI-FunAudioLLM

Features

CosyVoice

SenseVoice

How use

Windows

MacOS

WorkFlow

About

Uh oh!

Releases

Packages

Languages

zhilengjun/ComfyUI-FunAudioLLM_V2

Folders and files

Latest commit

History

Repository files navigation

已支持CosyVoice2、SenseVoice、InspireMusic

新增内容：

使用说明：

安装注意事项：

conda虚拟环境使用方式

以下是原项目说明：

ComfyUI-FunAudioLLM

Features

CosyVoice

SenseVoice

How use

Windows

MacOS

WorkFlow

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages