Explaining Autonomous Driving by Learning End-to-End Visual Attention
Abstract
Current deep learning based autonomous driving approaches yield impressive results also leading to inproduction deployment in certain controlled scenarios. One of the most popular and fascinating approaches relies on learning vehicle controls directly from data perceived by sensors. This end-to-end learning paradigm can be applied both in classical supervised settings and using reinforcement learning. Nonetheless the main drawback of this approach as also in other learning problems is the lack of ex- plainability. Indeed, a deep network will act as a black-box outputting predictions depending on previously seen driving patterns without giving any feedback on why such decisions were taken.While to obtain optimal performance it is not critical to obtain explainable outputs from a learned agent, especially in such a safety critical field, it is of paramount importance to understand how the network behaves. This is particularly relevant to interpret failures of such systems.In this work we propose to train an imitation learning based agent equipped with an attention model. The attention model allows us to understand what part of the image has been deemed most important. Interestingly, the use of attention also leads to superior performance in a standard benchmark using the CARLA driving simulator.
Cite
Text
Cultrera et al. "Explaining Autonomous Driving by Learning End-to-End Visual Attention." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2020. doi:10.1109/CVPRW50498.2020.00178Markdown
[Cultrera et al. "Explaining Autonomous Driving by Learning End-to-End Visual Attention." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2020.](https://mlanthology.org/cvprw/2020/cultrera2020cvprw-explaining/) doi:10.1109/CVPRW50498.2020.00178BibTeX
@inproceedings{cultrera2020cvprw-explaining,
title = {{Explaining Autonomous Driving by Learning End-to-End Visual Attention}},
author = {Cultrera, Luca and Seidenari, Lorenzo and Becattini, Federico and Pala, Pietro and Del Bimbo, Alberto},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2020},
pages = {1389-1398},
doi = {10.1109/CVPRW50498.2020.00178},
url = {https://mlanthology.org/cvprw/2020/cultrera2020cvprw-explaining/}
}