DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection

Abstract

Many point-based 3D detectors adopt point-feature sampling strategies to drop some points for efficient inference. These strategies are typically based on fixed and handcrafted rules, making it difficult to handle complicated scenes. Different from them, we propose a Dynamic Ball Query (DBQ) network to adaptively select a subset of input points according to the input features, and assign the feature transform with a suitable receptive field for each selected point. It can be embedded into some state-of-the-art 3D detectors and trained in an end-to-end manner, which significantly reduces the computational cost. Extensive experiments demonstrate that our method can reduce latency by 30%-100% on KITTI, Waymo, and ONCE datasets. Specifically, the inference speed of our detector can reach 162 FPS on KITTI scene, and 30 FPS on Waymo and ONCE scenes without performance degradation. Due to skipping the redundant points, some evaluation metrics show significant improvements.

Cite

Text

Yang et al. "DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection." International Conference on Learning Representations, 2023.

Markdown

[Yang et al. "DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection." International Conference on Learning Representations, 2023.](https://mlanthology.org/iclr/2023/yang2023iclr-dbqssd/)

BibTeX

@inproceedings{yang2023iclr-dbqssd,
  title     = {{DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection}},
  author    = {Yang, Jinrong and Song, Lin and Liu, Songtao and Mao, Weixin and Li, Zeming and Li, Xiaoping and Sun, Hongbin and Sun, Jian and Zheng, Nanning},
  booktitle = {International Conference on Learning Representations},
  year      = {2023},
  url       = {https://mlanthology.org/iclr/2023/yang2023iclr-dbqssd/}
}