Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods

Phillips, Adam; Rodriguez, Daniel Grandes; Sánchez-Manzano, Miriam; Salvadó, Alan; Garin, Manuel; Haro, Gloria; Ballester, Coloma

doi:10.1007/978-3-031-91572-7_21

Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods

Adam Phillips, Daniel Grandes Rodriguez, Miriam Sánchez-Manzano, Alan Salvadó, Manuel Garin, Gloria Haro, Coloma Ballester

ECCVW 2024 pp. 345-361

doi:10.1007/978-3-031-91572-7_21 /eccvw/2024/phillips2024eccvw-visual/

Abstract

In cinema, visual motifs are recurrent iconographic compositions that carry artistic or aesthetic significance. Their use throughout the history of visual arts and media is interesting to researchers and filmmakers alike. Our goal in this work is to recognise and classify these motifs by proposing a new machine learning model that uses a custom dataset to that end. We show how features extracted from a CLIP model can be leveraged by using a shallow network and an appropriate loss to classify images into 20 different motifs, with surprisingly good results: an $F_1$ F 1 -score of 0.91 on our test set. We also present several ablation studies justifying the input features, architecture and hyperparameters used.

PDF ECCVW Semantic Scholar

Cite

Text

Phillips et al. "Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods." European Conference on Computer Vision Workshops, 2024. doi:10.1007/978-3-031-91572-7_21

Markdown

[Phillips et al. "Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods." European Conference on Computer Vision Workshops, 2024.](https://mlanthology.org/eccvw/2024/phillips2024eccvw-visual/) doi:10.1007/978-3-031-91572-7_21

BibTeX

@inproceedings{phillips2024eccvw-visual,
  title     = {{Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods}},
  author    = {Phillips, Adam and Rodriguez, Daniel Grandes and Sánchez-Manzano, Miriam and Salvadó, Alan and Garin, Manuel and Haro, Gloria and Ballester, Coloma},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2024},
  pages     = {345-361},
  doi       = {10.1007/978-3-031-91572-7_21},
  url       = {https://mlanthology.org/eccvw/2024/phillips2024eccvw-visual/}
}