Two audio files.
Example
input1: example/00001_spk1.wav
input2: example/00024_spk1.wav
(Wav file from https://huggingface.co/spaces/wenet/wespeaker_demo)
Degree of similarity.
The speakers are 84.8% similar
Welcome, human!
This model recommends additional module.
pip3 install torch torchaudio
Automatically downloads the onnx and prototxt files on the first run. It is necessary to be connected to the Internet while downloading.
For the sample audio,
$ python3 wespeaker.py --input1 examples/00001_spk1.wav --input2 examples/00024_spk1.wavSpecify two audio files with the --input1 and --input2 options.
By specifying the --english option, it uses the English model
and the --chinese option for the Chinese.
$ python3 wespeaker.py --input1 AUDIO_FILE1 --input2 AUDIO_FILE2 --englishOnnxruntime
ONNX opset=14
voxceleb_resnet34.onnx.prototxt
cnceleb_resnet34.onnx.prototxt