🔥🔥🔥 A Survey on Uni-modal and Multi-modal Models
Project Page [This Page] | Paper | ✒️ Citation | 💬 WeChat (Emo微信交流群,欢迎加入)
This is the first work to comprehensive review of recent advancements in both uni-modal and multi-modal emotion recognition systems. ✨
🔥🔥🔥 EmoBench-M: Benchmarking Emotional Intelligence for Multimodal Large Language Models
You can experience our Basic Demo on ModelScope directly. The Real-Time Interactive Demo needs to be configured according to the instructions.
A representative evaluation benchmark for Emo. ✨
🔥🔥🔥 MERBench: A Unified Evaluation Benchmark for Multimodal Emotion Recognition
Paper | GitHub
🔥🔥🔥 emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Paper | GitHub
🔥🔥🔥 Uncertain Multimodal Intention and Emotion Understanding in the Wild
Paper | GitHub
🔥🔥🔥 MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning Benchmark
Paper | GitHub
🔥🔥🔥 Belief Mismatch Coefficient (BMC): A Novel Interpretable Measure of Prediction Accuracy for Ambiguous Emotion States
Paper (ACII 2023 Best Paper)
🔥🔥🔥 1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
Paper (1st place Odyssey 2024 Emotion Recognition Challenge)
🔥🔥🔥 Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective
Paper | GitHub
🔥🔥🔥 HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition
Paper | GitHub
🔥🔥🔥 Spectral Representation of Behaviour Primitives for Depression Analysis
Paper | GitHub (IEEE Transactions on Affective Computing, BEST PAPER RUNNER UP)
🔥🔥🔥 Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning
Paper
🔥🔥🔥 A Scoping Review of Large Language Models for Generative Tasks in Mental Health Care
Paper (NPJ| Digital Medicine)
This is the first work to comprehensive review of recent advancements in both uni-modal and multi-modal emotion recognition systems. ✨
Table of Contents
| Title | Venue | Date | Code | Demo |
|---|---|---|---|---|
Facial Emotion Recognition using CNN |
arXiv | 2023-09-11 | Github | - |
| Title | Venue | Date | Code | Demo |
|---|---|---|---|---|
emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation |
arXiv | 2023-06-10 | Github | - |
| Title | Venue | Date | Code | Demo |
|---|---|---|---|---|
Emotion Detection in Text using Natural Language Processing |
arXiv | 2025-08-15 | Github |
| Title | Venue | Date | Code | Demo |
|---|---|---|---|---|
Integrating Multimodal Information in Large Pretrained Transformers |
ACL | 2020-05-10 | GitHub | |
Tensor Fusion Network for Multimodal Sentiment Analysis |
EMNLP | 2017-09-01 | GitHub |
| Title | Venue | Date | Code | Demo |
|---|---|---|---|---|
MELD Dataset |
ACL | 2019 | GitHub | |
CREMA-D Dataset |
ICMI | 2014 | GitHub |

