AutoVideo: An Automated Video Action Recognition System

Abstract

Action recognition is an important task for video understanding with broad applications. However, developing an effective action recognition solution often requires extensive engineering efforts in building and testing different combinations of the modules and their hyperparameters. In this demo, we present AutoVideo, a Python system for automated video action recognition. AutoVideo is featured for 1) highly modular and extendable infrastructure following the standard pipeline language, 2) an exhaustive list of primitives for pipeline construction, 3) data-driven tuners to save the efforts of pipeline tuning, and 4) easy-to-use Graphical User Interface (GUI). AutoVideo is released under MIT license at https://github.com/datamllab/autovideo

Cite

Text

Zha et al. "AutoVideo: An Automated Video Action Recognition System." International Joint Conference on Artificial Intelligence, 2022. doi:10.24963/IJCAI.2022/862

Markdown

[Zha et al. "AutoVideo: An Automated Video Action Recognition System." International Joint Conference on Artificial Intelligence, 2022.](https://mlanthology.org/ijcai/2022/zha2022ijcai-autovideo/) doi:10.24963/IJCAI.2022/862

BibTeX

@inproceedings{zha2022ijcai-autovideo,
  title     = {{AutoVideo: An Automated Video Action Recognition System}},
  author    = {Zha, Daochen and Bhat, Zaid Pervaiz and Chen, Yi-Wei and Wang, Yicheng and Ding, Sirui and Chen, Jiaben and Lai, Kwei-Herng and Bhat, Mohammad Qazim and Jain, Anmoll Kumar and Costilla-Reyes, Alfredo and Zou, Na and Hu, Xia},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2022},
  pages     = {5952-5955},
  doi       = {10.24963/IJCAI.2022/862},
  url       = {https://mlanthology.org/ijcai/2022/zha2022ijcai-autovideo/}
}