DIY Human Action Dataset Generation

Abstract

The recent successes in applying deep learning techniques to solve standard computer vision problems has aspired researchers to propose new computer vision problems in different domains. As previously established in the field, training data itself plays a significant role in the machine learning process, especially deep learning approaches which are data hungry. In order to solve each new problem and get a decent performance, a large amount of data needs to be captured which may in many cases pose logistical difficulties. Therefore, the ability to generate de novo data or expand an existing dataset, however small, in order to satisfy data requirement of current networks may be invaluable. Herein, we introduce a novel way to partition an action video clip into action, subject and context. Each part is manipulated separately and reassembled with our proposed video generation technique. Furthermore, our novel human skeleton trajectory generation along with our proposed video generation technique, enables us to generate unlimited action recognition training data. These techniques enables us to generate video action clips from an small set without costly and time-consuming data acquisition. Lastly, we prove through extensive set of experiments on two small human action recognition datasets, that this new data generation technique can improve the performance of current action recognition neural nets.

Cite

Text

Khodabandeh et al. "DIY Human Action Dataset Generation." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2018. doi:10.1109/CVPRW.2018.00194

Markdown

[Khodabandeh et al. "DIY Human Action Dataset Generation." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2018.](https://mlanthology.org/cvprw/2018/khodabandeh2018cvprw-diy/) doi:10.1109/CVPRW.2018.00194

BibTeX

@inproceedings{khodabandeh2018cvprw-diy,
  title     = {{DIY Human Action Dataset Generation}},
  author    = {Khodabandeh, Mehran and Joze, Hamid Reza Vaezi and Zharkov, Ilya and Pradeep, Vivek},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2018},
  pages     = {1448-1458},
  doi       = {10.1109/CVPRW.2018.00194},
  url       = {https://mlanthology.org/cvprw/2018/khodabandeh2018cvprw-diy/}
}