CAR-Net: Clairvoyant Attentive Recurrent Network

Abstract

We present an interpretable framework for path prediction that leverages dependencies between agents' behaviors and their spatial navigation environment. We exploit two sources of information: the past motion trajectory of the agent of interest and a wide top-view image of the navigation scene. We propose a Clairvoyant Attentive Recurrent Network (CAR-Net) that learns where to look in a large image of the scene when solving the path prediction task. Our method can attend to any area, or a combination of areas, within the raw image (e.g., road intersections) when predicting the trajectory of the agent. This allows us to visualize fine-grained semantic elements of navigation scenes that influence the prediction of trajectories. To study the impact of space on agents' trajectories, we build a new dataset made of top-view images of hundreds of scenes (Formula One racing tracks) where agents' behaviors are heavily influenced by known areas in the images (e.g., upcoming turns). CAR-Net successfully attends to these salient regions. Additionally, CAR-Net reaches state-of-the-art accuracy on the standard trajectory forecasting benchmark, Stanford Drone Dataset (SDD). Finally, we show CAR-Net's ability to generalize to unseen scenes.

Cite

Text

Sadeghian et al. "CAR-Net: Clairvoyant Attentive Recurrent Network." Proceedings of the European Conference on Computer Vision (ECCV), 2018. doi:10.1007/978-3-030-01252-6_10

Markdown

[Sadeghian et al. "CAR-Net: Clairvoyant Attentive Recurrent Network." Proceedings of the European Conference on Computer Vision (ECCV), 2018.](https://mlanthology.org/eccv/2018/sadeghian2018eccv-carnet/) doi:10.1007/978-3-030-01252-6_10

BibTeX

@inproceedings{sadeghian2018eccv-carnet,
  title     = {{CAR-Net: Clairvoyant Attentive Recurrent Network}},
  author    = {Sadeghian, Amir and Legros, Ferdinand and Voisin, Maxime and Vesel, Ricky and Alahi, Alexandre and Savarese, Silvio},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2018},
  doi       = {10.1007/978-3-030-01252-6_10},
  url       = {https://mlanthology.org/eccv/2018/sadeghian2018eccv-carnet/}
}