TrueVoxAI is a Speech Emotion Recognition (SER) project that analyzes voice recordings to detect emotions such as Happy, Sad, Angry, Neutral, Fear, etc.
It combines Machine Learning + Audio Signal Processing + Web Deployment into one complete project.
- 🎤 Record or Upload Speech through the web interface
- 🧠 ML Model (MLP Classifier) trained on speech datasets (RAVDESS/CREMA-D)
- 🎼 Feature Extraction using MFCC, Chroma, Spectral Contrast
- 🌐 Flask-based Web App with a simple UI
- ☁️ Deployment-ready with Vercel + Hugging Face Spaces
- Python (librosa, scikit-learn, numpy, pandas, matplotlib)
- Flask (backend + UI rendering)
- MLPClassifier from scikit-learn for classification
- Vercel / Hugging Face Spaces for deployment
app.py
– Flask Apprequirements.txt
– Python Dependenciesvercel.json
– Vercel Deployment Configmodels/
– Trained ML artifactsmlp_emotion_model.pkl
– Trained Modelscaler.pkl
– Feature Normalizerlabel_encoder.pkl
– Label Mapping
templates/
– HTML UIstatic/
– CSS/JS AssetsNotebooks/TrueVox.ipynb
– Training Notebook
- User uploads/records speech 🎙️
- Audio features extracted using librosa 🎼
- Features normalized using scaler.pkl 📊
- Model predicts emotion using mlp_emotion_model.pkl 🧠
- Result displayed on UI 🎉
# clone repo
git clone https://github.com/tannuiscoding/TrueVoxAI.git
cd TrueVoxAI
# install dependencies
pip install -r requirements.txt
# run Flask app
python app.py
This project was built with ❤️ by:
Mansi | Tannu Choudhary |
---|---|
![]() |
![]() |