Semi-Supervised Active Learning for Video Action Detection

Singh, Ayush; Rana, Aayush Jung; Kumar, Akash; Vyas, Shruti; Rawat, Yogesh Singh

doi:10.1609/AAAI.V38I5.28292

Semi-Supervised Active Learning for Video Action Detection

Ayush Singh, Aayush Jung Rana, Akash Kumar, Shruti Vyas, Yogesh Singh Rawat

AAAI 2024 pp. 4891-4899

doi:10.1609/AAAI.V38I5.28292 /aaai/2024/singh2024aaai-semi/

Abstract

In this work, we focus on label efficient learning for video action detection. We develop a novel semi-supervised active learning approach which utilizes both labeled as well as un- labeled data along with informative sample selection for ac- tion detection. Video action detection requires spatio-temporal localization along with classification, which poses several challenges for both active learning (informative sample se- lection) as well as semi-supervised learning (pseudo label generation). First, we propose NoiseAug, a simple augmenta- tion strategy which effectively selects informative samples for video action detection. Next, we propose fft-attention, a novel technique based on high-pass filtering which enables effective utilization of pseudo label for SSL in video action detection by emphasizing on relevant activity region within a video. We evaluate the proposed approach on three different bench- mark datasets, UCF-101-24, JHMDB-21, and Youtube-VOS. First, we demonstrate its effectiveness on video action detec- tion where the proposed approach outperforms prior works in semi-supervised and weakly-supervised learning along with several baseline approaches in both UCF101-24 and JHMDB- 21. Next, we also show its effectiveness on Youtube-VOS for video object segmentation demonstrating its generalization capability for other dense prediction tasks in videos.

PDF AAAI Semantic Scholar

Cite

Text

Singh et al. "Semi-Supervised Active Learning for Video Action Detection." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I5.28292

Markdown

[Singh et al. "Semi-Supervised Active Learning for Video Action Detection." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/singh2024aaai-semi/) doi:10.1609/AAAI.V38I5.28292

BibTeX

@inproceedings{singh2024aaai-semi,
  title     = {{Semi-Supervised Active Learning for Video Action Detection}},
  author    = {Singh, Ayush and Rana, Aayush Jung and Kumar, Akash and Vyas, Shruti and Rawat, Yogesh Singh},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {4891-4899},
  doi       = {10.1609/AAAI.V38I5.28292},
  url       = {https://mlanthology.org/aaai/2024/singh2024aaai-semi/}
}