Stereo Neural Vernier Caliper

Abstract

We propose a new object-centric framework for learning-based stereo 3D object detection. Previous studies build scene-centric representations that do not consider the significant variation among outdoor instances and thus lack the flexibility and functionalities that an instance-level model can offer. We build such an instance-level model by formulating and tackling a local update problem, i.e., how to predict a refined update given an initial 3D cuboid guess. We demonstrate how solving this problem can complement scene-centric approaches in (i) building a coarse-to-fine multi-resolution system, (ii) performing model-agnostic object location refinement, and (iii) conducting stereo 3D tracking-by-detection. Extensive experiments demonstrate the effectiveness of our approach, which achieves state-of-the-art performance on the KITTI benchmark. Code and pre-trained models are available at https://github.com/Nicholasli1995/SNVC.

Cite

Text

Li et al. "Stereo Neural Vernier Caliper." AAAI Conference on Artificial Intelligence, 2022. doi:10.1609/AAAI.V36I2.20026

Markdown

[Li et al. "Stereo Neural Vernier Caliper." AAAI Conference on Artificial Intelligence, 2022.](https://mlanthology.org/aaai/2022/li2022aaai-stereo/) doi:10.1609/AAAI.V36I2.20026

BibTeX

@inproceedings{li2022aaai-stereo,
  title     = {{Stereo Neural Vernier Caliper}},
  author    = {Li, Shichao and Liu, Zechun and Shen, Zhiqiang and Cheng, Kwang-Ting},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2022},
  pages     = {1376-1385},
  doi       = {10.1609/AAAI.V36I2.20026},
  url       = {https://mlanthology.org/aaai/2022/li2022aaai-stereo/}
}