Unsupervised Few-Shot Action Recognition via Action-Appearance Aligned Meta-Adaptation
Abstract
We present MetaUVFS as the first Unsupervised Meta-learning algorithm for Video Few-Shot action recognition. MetaUVFS leverages over 550K unlabeled videos to train a two-stream 2D and 3D CNN architecture via contrastive learning to capture the appearance-specific spatial and action-specific spatio-temporal video features respectively. MetaUVFS comprises a novel Action-Appearance Aligned Meta-adaptation (A3M) module that learns to focus on the action-oriented video features in relation to the appearance features via explicit few-shot episodic meta-learning over unsupervised hard-mined episodes. Our action-appearance alignment and explicit few-shot learner conditions the unsupervised training to mimic the downstream few-shot task, enabling MetaUVFS to significantly outperform all unsupervised methods on few-shot benchmarks. Moreover, unlike previous few-shot action recognition methods that are supervised, MetaUVFS needs neither base-class labels nor a supervised pretrained backbone. Thus, we need to train MetaUVFS just once to perform competitively or sometimes even outperform state-of-the-art supervised methods on popular HMDB51, UCF101, and Kinetics100 few-shot datasets.
Cite
Text
Patravali et al. "Unsupervised Few-Shot Action Recognition via Action-Appearance Aligned Meta-Adaptation." International Conference on Computer Vision, 2021. doi:10.1109/ICCV48922.2021.00837Markdown
[Patravali et al. "Unsupervised Few-Shot Action Recognition via Action-Appearance Aligned Meta-Adaptation." International Conference on Computer Vision, 2021.](https://mlanthology.org/iccv/2021/patravali2021iccv-unsupervised/) doi:10.1109/ICCV48922.2021.00837BibTeX
@inproceedings{patravali2021iccv-unsupervised,
title = {{Unsupervised Few-Shot Action Recognition via Action-Appearance Aligned Meta-Adaptation}},
author = {Patravali, Jay and Mittal, Gaurav and Yu, Ye and Li, Fuxin and Chen, Mei},
booktitle = {International Conference on Computer Vision},
year = {2021},
pages = {8484-8494},
doi = {10.1109/ICCV48922.2021.00837},
url = {https://mlanthology.org/iccv/2021/patravali2021iccv-unsupervised/}
}