Coder Social home page Coder Social logo

wagner-niklas / cage_expression_inference Goto Github PK

View Code? Open in Web Editor NEW
7.0 3.0 0.0 53.33 MB

Project to infere emotional expressions and benchmark datasets by Niklas Wagner, Felix Mätzler, Samed R. Vossberg, Helen Schneider and Svetlana Pavlitska.

License: MIT License

Python 71.21% TeX 28.79%
affectnet circumplex-model emotic emotion-detection emotion-recognition face-detection facial-expression-recognition facial-recognition

cage_expression_inference's Introduction

Circumplex Affect Guided Expression Inference (CAGE)

Realtime Expression Inference Supported By The Circumplex Model

Keywords: User experience, Expression Inference, FER, Expression Recgonition, Emotion Recognition, Supervised Learning, Computer Vision, Data Set Comparison, Autonomous driving

PWC PWC PWC PWC PWC PWC PWC

Citation

If you use this repository or any of its contents please consider citiing our Paper: CAGE: Circumplex Affect Guided Expression Inference

@misc{wagner2024cage,
      title={CAGE: Circumplex Affect Guided Expression Inference}, 
      author={Niklas Wagner and Felix Mätzler and Samed R. Vossberg and Helen Schneider and Svetlana Pavlitska and J. Marius Zöllner},
      year={2024},
      eprint={2404.14975},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Abstract:

Understanding expressions and emotions is a task of interest across multiple disciplines, especially for improving user experiences. Contrary to the common perception, it has been shown that expressions are not discrete entities but instead exist along a continuum. People understand discrete expressions differently due to a variety of factors, including cultural background, individual experiences and cognitive biases. Therefore, most approaches to expression understanding, particularly those relying on discrete categories, are inherently biased. In this paper, we present a comparative indepth analysis of two common datasets (AffectNet and EMOTIC) equipped with the components of the circumplex model of affect. Further, we propose a model for prediction of facial expression tailored for lightweight applications. Using a small-scaled MaxViT-based model architecture, we evaluate the impact of discrete expression category labels in training with the continuous valence and arousal labels. We show that considering valence and arousal in addition to discrete category labels helps to significantly improve expression prediction. The proposed model outperforms the current state-of-the-art models on AffectNet, establishing it as the best-performing model for inferring valence and arousal achieving a 7% lower RMSE.

Model inference on a video:

Usage:

To run the version with our best performing model simply cd into the project directory and run: Install requirements:

pip install -r requirements.txt

If you want to train / alter the models you can run one of the python scripts in the directory. To run the train scripts, make sure you have the datasets of EMOTIC[1] and AffectNet[2] downloaded and saved in the right directory. The Datasets are not publically available and access has to be requested (EMOTIC, 2019) (AffectNet, 2017)

[1] R. Kosti, J.M. Álvarez, A. Recasens and A. Lapedriza, "Context based emotion recognition using emotic dataset", IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2019.

[2] Ali Mollahosseini, Behzad Hasani and Mohammad H. Mahoor, "AffectNet: A Database for Facial Expression, Valence, and Arousal Computing in the Wild," in IEEE Transactions on Affective Computing, vol. 10, no. 1, pp. 18-31, 1 Jan.-March 2019, doi: 10.1109/TAFFC.2017.2740923.'

Tasks of this project:

[1] Implement live video expression inference discrete

[2] Extend code to guess the continuous values of the circumplex model of affect

[3] Test model performance on AffectNet and EMOTIC

[4] Live test expression inference

[5] Research methods for validating and improving results for future work

cage_expression_inference's People

Contributors

wagner-niklas avatar samedvossberg avatar

Stargazers

LAVAL Alexandre avatar Varun Ganjigunte Prakash avatar  avatar Felix avatar  avatar Yin Chen avatar XuJ1E avatar

Watchers

 avatar Felix avatar  avatar

cage_expression_inference's Issues

Video inference script

Thanks a lot for this wonderful work! I wanted to test the model on new videos/images. Can you please provide any live video expression inference?

Trained weights

Hi,
I am trying to use these models for valence estimation.
However, I am unable to find the trained weights.
Many scripts are loading the file best_model_affectnet_improved7VA, which is not part of the repository.
Where can I find that file?
Thank.

Data Preprocessing

Hi @wagner-niklas,

I am trying to train the models to reproduce the results. I found that the key of images in AffectNet dataset is organized in numbers (as shown in *_set_annotation_without_lnd.csv). I wonder the way to map the original images to the image numbers.
Would you please provide codes on data preprocessing for image data, including renaming and processing? Thanks!

Best regards

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.