Generating Reliable Video Annotations by Exploiting the Crowd

Abstract

In computer vision and machine learning, the availability of annotated datasets is of crucial importance for both learning and performance evaluation. However, annotating visual datasets is a tedious and error-prone task and computer vision researchers usually dedicate a large amount of their time for collecting and generating annotations, which most of the time cannot be re-used in other scenarios. In this paper, we propose a simple, but effective, interactive video object segmentation method exploiting large noisy data gathered from crowd of users while playing a web game. Experimental results, carried out on two challenging video benchmarks, show how it is possible to generate reliable object segmentations in videos with a small human effort, achieving an accuracy comparable to the one obtained with manually-labeled annotations and also outperforming state-of-the-art video object segmentation approaches.

Cite

Text

Di Salvo et al. "Generating Reliable Video Annotations by Exploiting the Crowd." IEEE/CVF Winter Conference on Applications of Computer Vision, 2016. doi:10.1109/WACV.2016.7477718

Markdown

[Di Salvo et al. "Generating Reliable Video Annotations by Exploiting the Crowd." IEEE/CVF Winter Conference on Applications of Computer Vision, 2016.](https://mlanthology.org/wacv/2016/salvo2016wacv-generating/) doi:10.1109/WACV.2016.7477718

BibTeX

@inproceedings{salvo2016wacv-generating,
  title     = {{Generating Reliable Video Annotations by Exploiting the Crowd}},
  author    = {Di Salvo, Roberto and Spampinato, Concetto and Giordano, Daniela},
  booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision},
  year      = {2016},
  pages     = {1-8},
  doi       = {10.1109/WACV.2016.7477718},
  url       = {https://mlanthology.org/wacv/2016/salvo2016wacv-generating/}
}