Skip to content

tranquangchung/Audio_Speech_Conversation

Repository files navigation

Spoken Dialogue: Speech-to-Speech Conversation

This repository contains the implementation of a Speech-to-Speech (S2S) Dialogue System.


📥 Model Checkpoints

The pre-trained and fine-tuned checkpoints for this dialogue system are available on Hugging Face:

Download Link: tranquangchung/qwen2-audio-dialogue

You can clone the model using:

git lfs install
git clone https://huggingface.co/tranquangchung/qwen2-audio-dialogue

🚀 Getting Started

  1. Prerequisites Ensure you have Python 3.10+ and the necessary audio processing libraries installed:
pip install torch torchaudio transformers accelerate librosa
  1. To test the dialogue system with real-world audio samples ("in-the-wild"), run the provided inference script:
python test_dialogue_inthewild.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors