The
rtst is a Python package for comprehensive speech processing. It integrates speech recognition, language detection, translation, and text-to-speech functionalities into a single, easy-to-use function.
- Unified function for speech recognition, language detection, translation, and speaking the translation.
- Support for multiple speech recognition services: Google, Azure, AWS, and Hugging Face.
- Automatic translation of recognized speech to English.
- Vocalization of translated text.
Install the package using pip:
pip install rtstHere’s how to use rtst to recognize, translate, and vocalize speech:
The package provides a rtst function, which takes care of the entire speech processing workflow:
import rtst
# Call the function with your preferred service and language code
rtst.process_speech()This function will:
- Capture speech from the microphone.
- Recognize the speech using the specified service.
- Detect the language of the recognized speech.
- Translate the speech to English if it's in a different language.
- Vocalize the translated text.
For more advanced usage, refer to the package documentation. You can customize various aspects such as selecting different speech recognition services or handling different languages.
MIT License
Contributions to rtst are welcome. Please follow the guidelines provided in the repository for contributing.