Training Object Class Detectors from Eye Tracking Data

Abstract

Training an object class detector typically requires a large set of images annotated with bounding-boxes, which is expensive and time consuming to create. We propose novel approach to annotate object locations which can substantially reduce annotation time. We first track the eye movements of annotators instructed to find the object and then propose a technique for deriving object bounding-boxes from these fixations. To validate our idea, we collected eye tracking data for the trainval part of 10 object classes of Pascal VOC 2012 (6,270 images, 5 observers). Our technique correctly produces bounding-boxes in 50%of the images, while reducing the total annotation time by factor 6.8× compared to drawing bounding-boxes. Any standard object class detector can be trained on the bounding-boxes predicted by our model. Our large scale eye tracking dataset is available at groups.inf.ed.ac.uk/calvin/eyetrackdataset/ .

Cite

Text

Papadopoulos et al. "Training Object Class Detectors from Eye Tracking Data." European Conference on Computer Vision, 2014. doi:10.1007/978-3-319-10602-1_24

Markdown

[Papadopoulos et al. "Training Object Class Detectors from Eye Tracking Data." European Conference on Computer Vision, 2014.](https://mlanthology.org/eccv/2014/papadopoulos2014eccv-training/) doi:10.1007/978-3-319-10602-1_24

BibTeX

@inproceedings{papadopoulos2014eccv-training,
  title     = {{Training Object Class Detectors from Eye Tracking Data}},
  author    = {Papadopoulos, Dim P. and Clarke, Alasdair D. F. and Keller, Frank and Ferrari, Vittorio},
  booktitle = {European Conference on Computer Vision},
  year      = {2014},
  pages     = {361-376},
  doi       = {10.1007/978-3-319-10602-1_24},
  url       = {https://mlanthology.org/eccv/2014/papadopoulos2014eccv-training/}
}