Learning from Weakly-Labeled Web Videos via Exploring Sub-Concepts

Abstract

Learning visual knowledge from massive weakly-labeled web videos has attracted growing research interests thanks to the large corpus of easily accessible video data on the Internet. However, for video action recognition, the action of interest might only exist in arbitrary clips of untrimmed web videos, resulting in high label noises in the temporal space. To address this challenge, we introduce a new method for pre-training video action recognition models using queried web videos. Instead of trying to filter out potential noises, we propose to provide fine-grained supervision signals by defining the concept of Sub-Pseudo Label (SPL). Specifically, SPL spans out a new set of meaningful "middle ground" label space constructed by extrapolating the original weak labels during video querying and the prior knowledge distilled from a teacher model. Consequently, SPL provides enriched supervision for video models to learn better representations and improves data utilization efficiency of untrimmed videos. We validate the effectiveness of our method on four video action recognition datasets and a weakly-labeled image dataset. Experiments show that SPL outperforms several existing pre-training strategies and the learned representations lead to competitive results on several benchmarks.

Cite

Text

Li et al. "Learning from Weakly-Labeled Web Videos via Exploring Sub-Concepts." AAAI Conference on Artificial Intelligence, 2022. doi:10.1609/AAAI.V36I2.20022

Markdown

[Li et al. "Learning from Weakly-Labeled Web Videos via Exploring Sub-Concepts." AAAI Conference on Artificial Intelligence, 2022.](https://mlanthology.org/aaai/2022/li2022aaai-learning-a/) doi:10.1609/AAAI.V36I2.20022

BibTeX

@inproceedings{li2022aaai-learning-a,
  title     = {{Learning from Weakly-Labeled Web Videos via Exploring Sub-Concepts}},
  author    = {Li, Kunpeng and Zhang, Zizhao and Wu, Guanhang and Xiong, Xuehan and Lee, Chen-Yu and Lu, Zhichao and Fu, Yun and Pfister, Tomas},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2022},
  pages     = {1341-1349},
  doi       = {10.1609/AAAI.V36I2.20022},
  url       = {https://mlanthology.org/aaai/2022/li2022aaai-learning-a/}
}