From c1d59188dee90bd18ee4a2c312f431f00a9e15a2 Mon Sep 17 00:00:00 2001 From: Jax Leo <42597871+weaponmasterjax@users.noreply.github.com> Date: Tue, 28 Jan 2025 09:34:14 +0000 Subject: [PATCH 1/3] README: update ROCM version --- docs/fr/README.fr.md | 2 +- docs/pt/README.pt.md | 2 +- 2 files changed, 2 insertions(+), 2 deletions(-) diff --git a/docs/fr/README.fr.md b/docs/fr/README.fr.md index 18699232..9479a372 100644 --- a/docs/fr/README.fr.md +++ b/docs/fr/README.fr.md @@ -56,7 +56,7 @@ pip install torch torchvision torchaudio pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu117 # Pour Linux + carte AMD, utilisez cette version de Pytorch: -pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm5.4.2 +pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm6.2 ``` Vous pouvez utiliser poetry pour installer les dépendances : diff --git a/docs/pt/README.pt.md b/docs/pt/README.pt.md index 4d66a5e3..d416c516 100644 --- a/docs/pt/README.pt.md +++ b/docs/pt/README.pt.md @@ -62,7 +62,7 @@ pip install torch torchvision torchaudio #pip instalar tocha torchvision torchaudio --index-url https://download.pytorch.org/whl/cu117 #Para placas Linux + AMD, você precisa usar as seguintes versões do pytorch: -#pip instalar tocha torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm5.4.2 +#pip instalar tocha torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm6.2 ``` Então pode usar poesia para instalar as outras dependências: From cdc8696aed5c4d740e0586d44f17b0f6b60354f2 Mon Sep 17 00:00:00 2001 From: Jax Leo <42597871+weaponmasterjax@users.noreply.github.com> Date: Tue, 28 Jan 2025 06:27:52 +0000 Subject: [PATCH 2/3] README: add instructions for installing latest ROCM version of PyTorch --- docs/en/README.en.md | 221 +++++++++++++++++++++++++++++++++++++++++++ docs/fr/README.fr.md | 4 + docs/kr/README.ko.md | 6 ++ docs/pt/README.pt.md | 4 + 4 files changed, 235 insertions(+) create mode 100644 docs/en/README.en.md diff --git a/docs/en/README.en.md b/docs/en/README.en.md new file mode 100644 index 00000000..35d3df8c --- /dev/null +++ b/docs/en/README.en.md @@ -0,0 +1,221 @@ +
+ +

Retrieval-based-Voice-Conversion-WebUI

+An easy-to-use Voice Conversion framework based on VITS.

+ +[![madewithlove](https://img.shields.io/badge/made_with-%E2%9D%A4-red?style=for-the-badge&labelColor=orange +)](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI) + +
+ +[![Open In Colab](https://img.shields.io/badge/Colab-F9AB00?style=for-the-badge&logo=googlecolab&color=525252)](https://colab.research.google.com/github/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/Retrieval_based_Voice_Conversion_WebUI.ipynb) +[![Licence](https://img.shields.io/github/license/RVC-Project/Retrieval-based-Voice-Conversion-WebUI?style=for-the-badge)](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/LICENSE) +[![Huggingface](https://img.shields.io/badge/🤗%20-Spaces-yellow.svg?style=for-the-badge)](https://huggingface.co/lj1995/VoiceConversionWebUI/tree/main/) + +[![Discord](https://img.shields.io/badge/RVC%20Developers-Discord-7289DA?style=for-the-badge&logo=discord&logoColor=white)](https://discord.gg/HcsmBBGyVk) + +[**Changelog**](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/docs/Changelog_EN.md) | [**FAQ (Frequently Asked Questions)**](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/wiki/FAQ-(Frequently-Asked-Questions)) + +[**English**](../en/README.en.md) | [**中文简体**](../../README.md) | [**日本語**](../jp/README.ja.md) | [**한국어**](../kr/README.ko.md) ([**韓國語**](../kr/README.ko.han.md)) | [**Français**](../fr/README.fr.md) | [**Türkçe**](../tr/README.tr.md) | [**Português**](../pt/README.pt.md) + +
+ +> Check out our [Demo Video](https://www.bilibili.com/video/BV1pm4y1z7Gm/) here! + + + + + + + + + + + + + + + + + + +
Training and inference WebuiReal-time voice changing GUI
go-web.batgo-realtime-gui.bat
You can freely choose the action you want to perform.We have achieved an end-to-end latency of 170ms. With the use of ASIO input and output devices, we have managed to achieve an end-to-end latency of 90ms, but it is highly dependent on hardware driver support.
+ +> The dataset for the pre-training model uses nearly 50 hours of high quality audio from the VCTK open source dataset. + +> High quality licensed song datasets will be added to the training-set often for your use, without having to worry about copyright infringement. + +> Please look forward to the pretrained base model of RVCv3, which has larger parameters, more training data, better results, unchanged inference speed, and requires less training data for training. + +## Features: ++ Reduce tone leakage by replacing the source feature to training-set feature using top1 retrieval; ++ Easy + fast training, even on poor graphics cards; ++ Training with a small amounts of data (>=10min low noise speech recommended); ++ Model fusion to change timbres (using ckpt processing tab->ckpt merge); ++ Easy-to-use WebUI; ++ UVR5 model to quickly separate vocals and instruments; ++ High-pitch Voice Extraction Algorithm [InterSpeech2023-RMVPE](#Credits) to prevent a muted sound problem. Provides the best results (significantly) and is faster with lower resource consumption than Crepe_full; ++ AMD/Intel graphics cards acceleration supported; ++ Intel ARC graphics cards acceleration with IPEX supported. + +## Preparing the environment +The following commands need to be executed with Python 3.8 or higher. + +(Windows/Linux) +First install the main dependencies through pip: +```bash +# Install PyTorch-related core dependencies, skip if installed +# Reference: https://pytorch.org/get-started/locally/ +pip install torch torchvision torchaudio + +#For Windows + Nvidia Ampere Architecture(RTX30xx), you need to specify the cuda version corresponding to pytorch according to the experience of https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/issues/21 +#pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu117 + +#For Linux + AMD Cards, you need to use the following pytorch versions: +#pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm6.2 +``` + +Then can use poetry to install the other dependencies: +```bash +# Install the Poetry dependency management tool, skip if installed +# Reference: https://python-poetry.org/docs/#installation +curl -sSL https://install.python-poetry.org | python3 - + +# Install the project dependencies +poetry install +``` + +You can also use pip to install them: +```bash + +for Nvidia graphics cards + pip install -r requirements.txt + +for AMD/Intel graphics cards on Windows (DirectML): + pip install -r requirements-dml.txt + +for Intel ARC graphics cards on Linux / WSL using Python 3.10: + pip install -r requirements-ipex.txt + +for AMD graphics cards on Linux (ROCm): + pip install -r requirements-amd.txt +``` + +------ +Mac users can install dependencies via `run.sh`: +```bash +sh ./run.sh +``` + +## Preparation of other Pre-models +RVC requires other pre-models to infer and train. + +```bash +#Download all needed models from https://huggingface.co/lj1995/VoiceConversionWebUI/tree/main/ +python tools/download_models.py +``` + +Or just download them by yourself from our [Huggingface space](https://huggingface.co/lj1995/VoiceConversionWebUI/tree/main/). + +Here's a list of Pre-models and other files that RVC needs: +```bash +./assets/hubert/hubert_base.pt + +./assets/pretrained + +./assets/uvr5_weights + +Additional downloads are required if you want to test the v2 version of the model. + +./assets/pretrained_v2 + +If you want to test the v2 version model (the v2 version model has changed the input from the 256 dimensional feature of 9-layer Hubert+final_proj to the 768 dimensional feature of 12-layer Hubert, and has added 3 period discriminators), you will need to download additional features + +./assets/pretrained_v2 + +If you want to use the latest SOTA RMVPE vocal pitch extraction algorithm, you need to download the RMVPE weights and place them in the RVC root directory + +https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/rmvpe.pt + + For AMD/Intel graphics cards users you need download: + + https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/rmvpe.onnx + +``` + +### 2. Install FFmpeg +If you have FFmpeg and FFprobe installed on your computer, you can skip this step. + +#### For Ubuntu/Debian users +```bash +sudo apt install ffmpeg +``` +#### For MacOS users +```bash +brew install ffmpeg +``` +#### For Windows users +Download these files and place them in the root folder: +- [ffmpeg.exe](https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/ffmpeg.exe) + +- [ffprobe.exe](https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/ffprobe.exe) + +## ROCm Support for AMD graphic cards (Linux only) +To use ROCm on Linux install all required drivers as described [here](https://rocm.docs.amd.com/en/latest/deploy/linux/os-native/install.html). + +On Arch use pacman to install the driver: +```` +pacman -S rocm-hip-sdk rocm-opencl-sdk +```` + +You might also need to set these environment variables (e.g. on a RX6700XT): +```` +export ROCM_PATH=/opt/rocm #Set ROCM Executables Path +export HSA_OVERRIDE_GFX_VERSION=10.3.0 #Spoof GPU Model for ROCM +```` + +And overwrite PyTorch with its ROCM version after installing dependencies. +```` +pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm6.2 +```` + +Make sure your user is part of the `render` and `video` group: +```` +sudo usermod -aG render $USERNAME +sudo usermod -aG video $USERNAME +```` + +## Get started +### start up directly +Use the following command to start WebUI: +```bash +python infer-web.py +``` +### Use the integration package +Download and extract file `RVC-beta.7z`, then follow the steps below according to your system: +#### For Windows users +Double click `go-web.bat` +#### For MacOS users +```bash +sh ./run.sh +``` +### For Intel IPEX users (Linux Only) +```bash +source /opt/intel/oneapi/setvars.sh +``` +## Credits ++ [ContentVec](https://github.com/auspicious3000/contentvec/) ++ [VITS](https://github.com/jaywalnut310/vits) ++ [HIFIGAN](https://github.com/jik876/hifi-gan) ++ [Gradio](https://github.com/gradio-app/gradio) ++ [FFmpeg](https://github.com/FFmpeg/FFmpeg) ++ [Ultimate Vocal Remover](https://github.com/Anjok07/ultimatevocalremovergui) ++ [audio-slicer](https://github.com/openvpi/audio-slicer) ++ [Vocal pitch extraction:RMVPE](https://github.com/Dream-High/RMVPE) + + The pretrained model is trained and tested by [yxlllc](https://github.com/yxlllc/RMVPE) and [RVC-Boss](https://github.com/RVC-Boss). + +## Thanks to all contributors for their efforts + + + + diff --git a/docs/fr/README.fr.md b/docs/fr/README.fr.md index 9479a372..18c614b7 100644 --- a/docs/fr/README.fr.md +++ b/docs/fr/README.fr.md @@ -142,6 +142,10 @@ Vous devrez peut-être créer ces variables d'environnement (par exemple avec RX export ROCM_PATH=/opt/rocm export HSA_OVERRIDE_GFX_VERSION=10.3.0 ```` +Et remplacer PyTorch par sa version ROCM après l'installation des dépendances. +```` +pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm6.2 +```` Assurez-vous que votre utilisateur est dans les groupes `render` et `video`: ```` sudo usermod -aG render $USERNAME diff --git a/docs/kr/README.ko.md b/docs/kr/README.ko.md index 00bc8072..1a9df117 100644 --- a/docs/kr/README.ko.md +++ b/docs/kr/README.ko.md @@ -306,6 +306,12 @@ export ROCM_PATH=/opt/rocm export HSA_OVERRIDE_GFX_VERSION=10.3.0 ``` +그리고 종속 요소를 설치한 후 PyTorch를 ROCM 버전으로 덮어씁니다. + +```` +pip 설치 토치 토치비전 토치오디오 --index-url https://download.pytorch.org/whl/rocm6.2 +```` + 동시에 현재 사용자가 `render` 및 `video` 사용자 그룹에 속해 있는지 확인하세요. ``` diff --git a/docs/pt/README.pt.md b/docs/pt/README.pt.md index d416c516..e6e62379 100644 --- a/docs/pt/README.pt.md +++ b/docs/pt/README.pt.md @@ -155,6 +155,10 @@ Talvez você também precise definir estas variáveis de ambiente (por exemplo, export ROCM_PATH=/opt/rocm export HSA_OVERRIDE_GFX_VERSION=10.3.0 ```` +E substitui o PyTorch pela sua versão ROCM depois de instalar as dependências.a +```` +pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm6.2 +```` Verifique também se seu usuário faz parte do grupo `render` e `video`: ```` sudo usermod -aG render $USERNAME From 2e26611f086d288b7b811eccf822974e98769c48 Mon Sep 17 00:00:00 2001 From: Jax Leo <42597871+weaponmasterjax@users.noreply.github.com> Date: Tue, 28 Jan 2025 06:38:20 +0000 Subject: [PATCH 3/3] README: add hint for installing PyTorch with ROCM on Linux --- README.md | 10 ++++++++-- 1 file changed, 8 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 9d9c175e..f4010e56 100644 --- a/README.md +++ b/README.md @@ -73,6 +73,7 @@ sh ./run.sh ```bash pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu117 ``` + 3. Install the corresponding dependencies according to your own graphics card. - Nvidia GPU ```bash @@ -91,6 +92,11 @@ sh ./run.sh pip install -r requirements/ipex.txt ``` +4.If you are using an ROCM-capable AMD Radeon GPU, then you need to choose ROCM version of PyTorch. + ```bash + pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm6.2 + ``` + ## Preparation of Other Files ### 1. Assets > RVC requires some models located in the `assets` folder for inference and training. @@ -156,8 +162,8 @@ pacman -S rocm-hip-sdk rocm-opencl-sdk ```` For some models of graphics cards, you may need to configure the following environment variables (such as: RX6700XT). ```` -export ROCM_PATH=/opt/rocm -export HSA_OVERRIDE_GFX_VERSION=10.3.0 +export ROCM_PATH=/opt/rocm #Set ROCM Executables Path +export HSA_OVERRIDE_GFX_VERSION=10.3.0 #Spoof GPU Model for ROCM ```` Also, make sure your current user is in the `render` and `video` user groups. ````