What Object Should I Use? - Task Driven Object Detection

Abstract

When humans have to solve everyday tasks, they simply pick the objects that are most suitable. While the question which object should one use for a specific task sounds trivial for humans, it is very difficult to answer for robots or other autonomous systems. This issue, however, is not addressed by current benchmarks for object detection that focus on detecting object categories. We therefore introduce the COCO-Tasks dataset which comprises about 40,000 images where the most suitable objects for 14 tasks have been annotated. We furthermore propose an approach that detects the most suitable objects for a given task. The approach builds on a Gated Graph Neural Network to exploit the appearance of each object as well as the global context of all present objects in the scene. In our experiments, we show that the proposed approach outperforms other approaches that are evaluated on the dataset like classification or ranking approaches.

Cite

Text

Sawatzky et al. "What Object Should I Use? - Task Driven Object Detection." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. doi:10.1109/CVPR.2019.00779

Markdown

[Sawatzky et al. "What Object Should I Use? - Task Driven Object Detection." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.](https://mlanthology.org/cvpr/2019/sawatzky2019cvpr-object/) doi:10.1109/CVPR.2019.00779

BibTeX

@inproceedings{sawatzky2019cvpr-object,
  title     = {{What Object Should I Use? - Task Driven Object Detection}},
  author    = {Sawatzky, Johann and Souri, Yaser and Grund, Christian and Gall, Jurgen},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2019},
  doi       = {10.1109/CVPR.2019.00779},
  url       = {https://mlanthology.org/cvpr/2019/sawatzky2019cvpr-object/}
}