Speech Emotion Recognition using Semantic Information

This repository provides training and evaluation code for the paper Speech Emotion Recognition using Semantic Information (ICASSP 2021). If you use this codebase in your experiments please cite:

Tzirakis, P., Nguyen, A., Zafeiriou, S., & Schuller, B. W. (2021). Speech Emotion Recognition using Semantic Information. arXiv preprint arXiv:2103.02993.

This repository provides the following:

Word2Vec embeddings trained on the German SWC corpus.
Speech2Vec embeddings trained on the SEWA DB.
Code to train/evaluate word2vec and our unified model, i. e., paralinguistic and semantic feature extrators with a LSTM cell on top.

Requirements

Below are listed the required modules to run the code.

aeneas
librosa
nltk
numpy
stop-words
tensorflow
torch

Steps

Create the speech2vec segmentation by running speech2word_mapping.py in speech2vec folder.
Run data_generator.py to create tfrecords.
Run train.py to train the models, and eval.py to evaluate.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speech Emotion Recognition using Semantic Information

Requirements

Steps

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Speech Emotion Recognition using Semantic Information

Requirements

Steps