Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments

Abstract

We present a novel method for few-shot video classification, which performs appearance and temporal alignments. In particular, given a pair of query and support videos, we conduct appearance alignment via frame-level feature matching to achieve the appearance similarity score between the videos, while utilizing temporal order-preserving priors for obtaining the temporal similarity score between the videos. Moreover, we introduce a few-shot video classification framework that leverages the above appearance and temporal similarity scores across multiple steps, namely prototype-based training and testing as well as inductive and transductive prototype refinement. To the best of our knowledge, our work is the first to explore transductive few-shot video classification. Extensive experiments on both Kinetics and Something-Something V2 datasets show that both appearance and temporal alignments are crucial for datasets with temporal order sensitivity such as Something-Something V2. Our approach achieves similar or better results than previous methods on both datasets.

Cite

Text

Nguyen et al. "Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-20044-1_27

Markdown

[Nguyen et al. "Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/nguyen2022eccv-inductive/) doi:10.1007/978-3-031-20044-1_27

BibTeX

@inproceedings{nguyen2022eccv-inductive,
  title     = {{Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments}},
  author    = {Nguyen, Khoi D. and Tran, Quoc-Huy and Nguyen, Khoi and Hua, Binh-Son and Nguyen, Rang},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-20044-1_27},
  url       = {https://mlanthology.org/eccv/2022/nguyen2022eccv-inductive/}
}