Affect2MM: Affective Analysis of Multimedia Content Using Emotion Causality

Abstract

We present Affect2MM, a learning method for time-series emotion prediction for multimedia content. Our goal is to automatically capture the varying emotions depicted by characters in real-life human-centric situations and behaviors. We use the ideas from emotion causation theories to computationally model and determine the emotional state evoked in clips of movies. Affect2MM explicitly models the temporal causality using attention-based methods and Granger causality. We use a variety of components like facial features of actors involved, scene understanding, visual aesthetics, action/situation description, and movie script to obtain an affective-rich representation to understand and perceive the scene. We use an LSTM-based learning model for emotion perception. To evaluate our method, we analyze and compare our performance on three datasets, SENDv1, MovieGraphs, and the LIRIS-ACCEDE dataset, and observe an average of 10-15% increase in the performance over SOTA methods for all three datasets.

Cite

Text

Mittal et al. "Affect2MM: Affective Analysis of Multimedia Content Using Emotion Causality." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.00561

Markdown

[Mittal et al. "Affect2MM: Affective Analysis of Multimedia Content Using Emotion Causality." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/mittal2021cvpr-affect2mm/) doi:10.1109/CVPR46437.2021.00561

BibTeX

@inproceedings{mittal2021cvpr-affect2mm,
  title     = {{Affect2MM: Affective Analysis of Multimedia Content Using Emotion Causality}},
  author    = {Mittal, Trisha and Mathur, Puneet and Bera, Aniket and Manocha, Dinesh},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2021},
  pages     = {5661-5671},
  doi       = {10.1109/CVPR46437.2021.00561},
  url       = {https://mlanthology.org/cvpr/2021/mittal2021cvpr-affect2mm/}
}