Coder Social home page Coder Social logo

gcunhase / emotional-video-to-audio-with-anfis-deeprnn Goto Github PK

View Code? Open in Web Editor NEW
25.0 4.0 6.0 2.63 MB

Emotional Video to Audio Transformation with ANFIS-DeepRNN (Vanilla RNN and LSTM-DeepRNN) [MPE 2020]

Home Page: https://www.hindawi.com/journals/mpe/2020/8478527/

MATLAB 96.03% HTML 3.95% Shell 0.02%
audio music-generation visual-features audio-features anfis emotion-labels lindsey-stirling-dataset deap deap-dataset matlab

emotional-video-to-audio-with-anfis-deeprnn's Introduction

About

Repository for paper titled "Emotional Video to Audio Transformation Using Deep Recurrent Neural Networks and a Neuro-Fuzzy System".

Contents

RequirementsDatasetHow to UseHow to Cite

Requirements

Matlab 2017, Mac OS

Toolboxes: Fuzzy Logic, Deep Learning

Dataset

Both datasets have emotion label in the 2D-axis (Valence and Arousal)

  • 8 music videos
  • Emotion labels: dataset/lindsey stirling dataset/user_response*.tsv
  • 38 music videos
  • Emotion labels: dataset/deap dataset/participant_ratings.csv

Model

  • Extract audio and visual features
  • ANFIS for emotion classification of visual features
  • Seq2Seq for audio feature generation (multi-modal domain transformation)
  • Mapping of audio features to audio snippets for music generation

How to Use

All the codes are for the Lindsey Stirling Dataset. The codes corresponding to the DEAP Dataset are also available.

  1. Change current folder to where this file is located

  2. Download datasets

  3. Extract audio and visual features

    • Extract sound features:
      scripts/emotion_from_sound/main_sound2feat_lindsey.m
      
    • Extract visual features:
      scripts/emotion_from_visual/main_video2feat_lindsey.m
      
  4. Train:

    • Settings and Load data:
      scripts/model/main_settings.m
      
    • ANFIS for emotion classification from HSL (visual features):
      scripts/model/main_anfis.m
      
    • Seq2Seq for domain transformation from visual to audio features:
      scripts/model/main_seq2seq_train.m
      
  5. Evaluation (music generation from visual features)

    • Extract sound features (test data):
      scripts/emotion_from_sound/main_sound2feat_lindsey_test_individual.m
      
    • Extract visual features (test data):
      scripts/emotion_from_visual/main_video2feat_lindsey_test_individual.m
      
    • Settings and Load data:
      scripts/model/main_settings.m
      
    • Eval
      scripts/model/main_anfis_seq2seq_test.m
      
  6. Evaluation of MTurk results in scripts/eval_mturk

Notes

Acknowledgement

In case you wish to use this code, please use the following citation:

@article{sergio2020mpe,
   AUTHOR={{Sergio, G. C., and Lee, M.}},
   TITLE={Emotional Video to Audio Transformation Using Deep Recurrent Neural Networks and a Neuro-Fuzzy System},
   JOURNAL={Mathematical Problems in Engineering},
   VOLUME={2020},
   PAGES={1--15},
   DOI={https://doi.org/10.1155/2020/8478527},
   YEAR={2020}
}

Contact: [email protected]

emotional-video-to-audio-with-anfis-deeprnn's People

Contributors

gcunhase avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.