Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction

Abstract

3D bounding boxes are a widespread intermediate representation in many computer vision applications. However, predicting them is a challenging task, largely due to partial observability, which motivates the need for a strong sense of uncertainty. While many recent methods have explored better architectures for consuming sparse and unstructured point cloud data, we hypothesize that there is room for improvement in the modeling of the output distribution and explore how this can be achieved using an autoregressive prediction head. Additionally, we release a simulated dataset, COB-3D, which highlights new types of ambiguity that arise in real-world robotics applications, where 3D bounding box prediction has largely been underexplored. We propose methods for leveraging our autoregressive model to make high confidence predictions and meaningful uncertainty measures, achieving strong results on SUN-RGBD, Scannet, KITTI, and our new dataset. Code and dataset are available at https://bbox.yuxuanliu.com.

Cite

Text

Liu et al. "Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-20080-9_39

Markdown

[Liu et al. "Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/liu2022eccv-autoregressive/) doi:10.1007/978-3-031-20080-9_39

BibTeX

@inproceedings{liu2022eccv-autoregressive,
  title     = {{Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction}},
  author    = {Liu, YuXuan and Mishra, Nikhil and Sieb, Maximilian and Shentu, Yide and Abbeel, Pieter and Chen, Xi},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-20080-9_39},
  url       = {https://mlanthology.org/eccv/2022/liu2022eccv-autoregressive/}
}