LabelFormer: Object Trajectory Refinement for Offboard Perception from LiDAR Point Clouds

Abstract

A major bottleneck to scaling-up training of self-driving perception systems are the human annotations required for supervision. A promising alternative is to leverage “auto-labelling" offboard perception models that are trained to automatically generate annotations from raw LiDAR point clouds at a fraction of the cost. Auto-labels are most commonly generated via a two-stage approach – first objects are detected and tracked over time, and then each object trajectory is passed to a learned refinement model to improve accuracy. Since existing refinement models are overly complex and lack advanced temporal reasoning capabilities, in this work we propose LabelFormer, a simple, efficient, and effective trajectory-level refinement approach. Our approach first encodes each frame’s observations separately, then exploits self-attention to reason about the trajectory with full temporal context, and finally decodes the refined object size and per-frame poses. Evaluation on both urban and highway datasets demonstrates that LabelFormer outperforms existing works by a large margin. Finally, we show that training on a dataset augmented with auto-labels generated by our method leads to improved downstream detection performance compared to existing methods. Please visit the project website for details https://waabi.ai/labelformer/.

Cite

Text

Yang et al. "LabelFormer: Object Trajectory Refinement for Offboard Perception from LiDAR Point Clouds." Conference on Robot Learning, 2023.

Markdown

[Yang et al. "LabelFormer: Object Trajectory Refinement for Offboard Perception from LiDAR Point Clouds." Conference on Robot Learning, 2023.](https://mlanthology.org/corl/2023/yang2023corl-labelformer/)

BibTeX

@inproceedings{yang2023corl-labelformer,
  title     = {{LabelFormer: Object Trajectory Refinement for Offboard Perception from LiDAR Point Clouds}},
  author    = {Yang, Anqi Joyce and Casas, Sergio and Dvornik, Nikita and Segal, Sean and Xiong, Yuwen and Hu, Jordan Sir Kwang and Fang, Carter and Urtasun, Raquel},
  booktitle = {Conference on Robot Learning},
  year      = {2023},
  pages     = {3364-3383},
  volume    = {229},
  url       = {https://mlanthology.org/corl/2023/yang2023corl-labelformer/}
}