A Multi-modal Visual Emotion Recognition Method to Instantiate an Ontology

The goal of this project is to recognise emotions in most situations. For this, four kinds of data are taken: face, posture, body and context/environment features. Each of these are processed independently and then combined with a merging method called EmbraceNet+, wich is an extention of the EmbraceNet.

This repository include the fully implementation of the multi-modal method in a Google Colab environment, its training and performed tests.

Disclaimer: Due to lack of resources and time, the code was not tested locally. Perhaps the notebooks available will help with errors.

Requeriments

pytorch >= 1.6
CUDA 10.1
mxnet-cu101
insightface
numpy
cv2
tqdm
pandas
sklearn

Data Used

The used data are shared here, this zip contains all the data for each modality in numpy array format.

This dataset is acquire form the original EMOTIC dataset.

Moreover, the number of annotated emotions in EMOTIC (26) were reduced by grouping, following the taxonomy of Mr. Plutchik, into eight groups.

The weighted random sampler from pytorch was used in training time trying to solve the unbalancing of the EMOTIC dataset.

Execution

Google Colab

First notebook, EmbraeNet_Plus.ipynb, contains the EMOTIC procesing, modalities input adequation, and the training and test of four independent modalities and of the multi-modal method.
Second notebook, Demo_n_other_evals.ipynb, contains the procedures for carry out the inferences of the test results, and the procedures for fully use of the method, from input image to get the output, calculating the execution time.

Local

The file tree must look like this:

  multimodalDLforER
  |-checkpoints
  | |-models
  | | |-bodyabn_last.pth
  | | |-contextabn_best.pth
  | | |-facevgg_best.pth
  | | \\-posedgcnn_ws_last.pth
  | |-checkpoints
  | | \\...
  | |-thresholds
  | | |-thresholds_validation.npy
  | | \\...
  | |-YOLO
  | | |-YOLO-weights
  | | | \\...
  | | |-coco.names
  | | \\...
  | \\...
  | |-hrnet_w48_384x288.pth
  | \\...
  |-configs
  | |-embracenet_plus.json
  | \\...
  |-EMOTIC
  | \\...
  |-models
  | |-utils
  | | \\..
  | \\...
  |-utils
  | \\...
  \\-processor.py

Training

  python processor.py -u checkpoints/models/ -o train -d EMOTIC/ -c configs/embracenet_plus.json -g 0 -s checkpoints/ebnplus -v

Test

  python processor.py -p -u checkpoints/models/ -m checkpoints/checkpoints/ebnplus/ebnp.pth -o test -d EMOTIC/ -c configs/embracenet_plus.json -g 0

Inference

  python processor.py -p -u checkpoints/models/ -m checkpoints/checkpoints/ebnplus/ebnp.pth -o inference -c configs/embracenet_plus.json -g 0 -i img.png -r checkpoints/thresholds/thresholds_validation.npy

Help of arguments

[-h] [-a UNIMODAL] [-t MODAlity] [-p PRETRAINED] [-n UNIMODEL] [-u UNIMODELS] [-m MULTIMODEL] [-o MODE] [-d DATASET] [-c CONFIGURATION] [-g CUDA] [-s SAVENAME] [-v OVERSAMPLE] [-i INPUTFILE] [-r THRESHOLD]

Citation

If you use our code or models in your research, please cite with:

@inproceedings{heredia2021multi,
  title={A Multi-modal Visual Emotion Recognition Method to Instantiate an Ontology},
  author={Heredia, Juan and Cardinale, Yudith and Dongo, Irvin and D{\'\i}az-Amado, Jose},
  booktitle={16th International Conference on Software Technologies},
  pages={453--464},
  year={2021},
  organization={SCITEPRESS-Science and Technology Publications}
}

Acknowledgments

This research was supported by the FONDO NACIONAL DEDESARROLLO CIENTÍFICO, TECNOLÓGICO Y DE INNOVACIÓN TECNOLÓGICA - FONDECYT as executing entity of CONCYTEC under grant agreement no.01-2019-FONDECYT-BM-INC.INV in the project RUTAS: Robots for Urban Tourism,Autonomous and Semantic web based.

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
configs		configs
figures		figures
models		models
utils		utils
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
Demo_n_other_evals.ipynb		Demo_n_other_evals.ipynb
EmbraeNet_Plus.ipynb		EmbraeNet_Plus.ipynb
Readme.md		Readme.md
processor.py		processor.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Multi-modal Visual Emotion Recognition Method to Instantiate an Ontology

Requeriments

Data Used

Execution

Google Colab

Local

Training

Test

Inference

Help of arguments

Citation

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

juan1t0/multimodalDLforER

Folders and files

Latest commit

History

Repository files navigation

A Multi-modal Visual Emotion Recognition Method to Instantiate an Ontology

Requeriments

Data Used

Execution

Google Colab

Local

Training

Test

Inference

Help of arguments

Citation

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages