Commands 4 Autonomous Vehicles (C4AV) Workshop Summary

Abstract

The task of visual grounding requires locating the most relevant region or object in an image, given a natural language query. So far, progress on this task was mostly measured on curated datasets, which are not always representative of human spoken language. In this work, we deviate from recent, popular task settings and consider the problem under an autonomous vehicle scenario. In particular, we consider a situation where passengers can give free-form natural language commands to a vehicle which can be associated with an object in the street scene. To stimulate research on this topic, we have organized the \emph{Commands for Autonomous Vehicles} (C4AV) challenge based on the recent \emph{Talk2Car} dataset (URL: this https URL). This paper presents the results of the challenge. First, we compare the used benchmark against existing datasets for visual grounding. Second, we identify the aspects that render top-performing models successful, and relate them to existing state-of-the-art models for visual grounding, in addition to detecting potential failure cases by evaluating on carefully selected subsets. Finally, we discuss several possibilities for future work.

Cite

Text

Deruyttere et al. "Commands 4 Autonomous Vehicles (C4AV) Workshop Summary." European Conference on Computer Vision Workshops, 2020. doi:10.1007/978-3-030-66096-3_1

Markdown

[Deruyttere et al. "Commands 4 Autonomous Vehicles (C4AV) Workshop Summary." European Conference on Computer Vision Workshops, 2020.](https://mlanthology.org/eccvw/2020/deruyttere2020eccvw-commands/) doi:10.1007/978-3-030-66096-3_1

BibTeX

@inproceedings{deruyttere2020eccvw-commands,
  title     = {{Commands 4 Autonomous Vehicles (C4AV) Workshop Summary}},
  author    = {Deruyttere, Thierry and Vandenhende, Simon and Grujicic, Dusan and Liu, Yu and Van Gool, Luc and Blaschko, Matthew B. and Tuytelaars, Tinne and Moens, Marie-Francine},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2020},
  pages     = {3-26},
  doi       = {10.1007/978-3-030-66096-3_1},
  url       = {https://mlanthology.org/eccvw/2020/deruyttere2020eccvw-commands/}
}