Active Vision Dataset Benchmark

Abstract

Several recent efforts in computer vision indicate a trend toward studying and understanding problems in larger scale environments, beyond single images, and focus on connections to tasks in navigation, mobile manipulation, and visual question answering. A common goal of these tasks is the capability of moving in the environment, acquiring novel views during perception and while performing a task. This capability comes easily in synthetic environments, however achieving the same effect with real images is much more laborious. We propose using the existing Active Vision Dataset to form a benchmark for such problems in a real-world settings with real images. The dataset is well suited for evaluating tasks of multiview active recognition, target driven navigation, and target search, and also can be effective for studying the transfer of strategies learned in simulation to real settings.

Cite

Text

Ammirato et al. "Active Vision Dataset Benchmark." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2018. doi:10.1109/CVPRW.2018.00277

Markdown

[Ammirato et al. "Active Vision Dataset Benchmark." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2018.](https://mlanthology.org/cvprw/2018/ammirato2018cvprw-active/) doi:10.1109/CVPRW.2018.00277

BibTeX

@inproceedings{ammirato2018cvprw-active,
  title     = {{Active Vision Dataset Benchmark}},
  author    = {Ammirato, Phil and Berg, Alexander C. and Kosecka, Jana},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2018},
  pages     = {2046-2049},
  doi       = {10.1109/CVPRW.2018.00277},
  url       = {https://mlanthology.org/cvprw/2018/ammirato2018cvprw-active/}
}