Video file:
https://github.com/bytedance/LatentSync/blob/main/assets/demo1_video.mp4
Audio file:
https://github.com/bytedance/LatentSync/blob/main/assets/demo1_audio.wav
Video file
This model requires additional module.
pip3 install face_alignment
pip3 install librosa
Install ffmepg package.
apt-get install ffmpeg
Automatically downloads the onnx and prototxt files on the first run. It is necessary to be connected to the Internet while downloading.
For the sample video, audio,
$ python3 latentsync.pyIf you want to specify the video and audio files, put the file paths after the --video and --input options.
You can use the --savepath option to change the name of the output file to save.
$ python3 latentsync.py --input AUDIO_FILE --video VIDEO_FILE --savepath OUTPUT_FILEPytorch
ONNX opset=17
unet.onnx.prototxt
vae_encoder.onnx.prototxt
vae_decoder.onnx.prototxt
whisper_tiny.onnx.prototxt