Do Deepfakes Feel Emotions? a Semantic Approach to Detecting Deepfakes via Emotional Inconsistencies
Abstract
Recent advances in deep learning and computer vision have spawned a new class of media forgeries known as deepfakes, which typically consist of artificially generated human faces or voices. The creation and distribution of deepfakes raise many legal and ethical concerns. As a result, the ability to distinguish between deepfakes and authentic media is vital. While deepfakes can create plausible video and audio, it may be challenging for them to to generate content that is consistent in terms of high-level semantic features, such as emotions. Unnatural displays of emotion, measured by features such as valence and arousal, can provide significant evidence that a video has been synthesized. In this paper, we propose a novel method for detecting deepfakes of a human speaker using the emotion predicted from the speaker’s face and voice. The proposed technique leverages Long Short-Term Memory (LSTM) networks that predict emotion from audio and video Low-Level Descriptors (LLDs). Predicted emotion in time is used to classify videos as authentic or deepfakes through an additional supervised classifier.
Cite
Text
Hosler et al. "Do Deepfakes Feel Emotions? a Semantic Approach to Detecting Deepfakes via Emotional Inconsistencies." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2021. doi:10.1109/CVPRW53098.2021.00112Markdown
[Hosler et al. "Do Deepfakes Feel Emotions? a Semantic Approach to Detecting Deepfakes via Emotional Inconsistencies." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2021.](https://mlanthology.org/cvprw/2021/hosler2021cvprw-deepfakes/) doi:10.1109/CVPRW53098.2021.00112BibTeX
@inproceedings{hosler2021cvprw-deepfakes,
title = {{Do Deepfakes Feel Emotions? a Semantic Approach to Detecting Deepfakes via Emotional Inconsistencies}},
author = {Hosler, Brian C. and Salvi, Davide and Murray, Anthony and Antonacci, Fabio and Bestagini, Paolo and Tubaro, Stefano and Stamm, Matthew C.},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2021},
pages = {1013-1022},
doi = {10.1109/CVPRW53098.2021.00112},
url = {https://mlanthology.org/cvprw/2021/hosler2021cvprw-deepfakes/}
}