Automatically Labeling Video Data Using Multi-Class Active Learning

Abstract

Labeling video data is an essential prerequisite for many vision applications that depend on training data, such as visual information retrieval, object recognition, and human activity modelling. However, manually creating labels is not only time-consuming but also subject to human errors, and eventually, becomes impossible for a very large amount of data (e.g. 24/7 surveillance video). To minimize the human effort in labeling, we propose a unified multiclass active learning approach for automatically labeling video data. We include extending active learning from binary classes to multiple classes and evaluating several practical sample selection strategies. The experimental results show that the proposed approach works effectively even with a significantly reduced amount of labeled data. The best sample selection strategy can achieve more than a 50% error reduction over random sample selection.

Cite

Text

Yan et al. "Automatically Labeling Video Data Using Multi-Class Active Learning." IEEE/CVF International Conference on Computer Vision, 2003. doi:10.1109/ICCV.2003.1238391

Markdown

[Yan et al. "Automatically Labeling Video Data Using Multi-Class Active Learning." IEEE/CVF International Conference on Computer Vision, 2003.](https://mlanthology.org/iccv/2003/yan2003iccv-automatically/) doi:10.1109/ICCV.2003.1238391

BibTeX

@inproceedings{yan2003iccv-automatically,
  title     = {{Automatically Labeling Video Data Using Multi-Class Active Learning}},
  author    = {Yan, Rong and Yang, Jie and Hauptmann, Alexander G.},
  booktitle = {IEEE/CVF International Conference on Computer Vision},
  year      = {2003},
  pages     = {516-523},
  doi       = {10.1109/ICCV.2003.1238391},
  url       = {https://mlanthology.org/iccv/2003/yan2003iccv-automatically/}
}