FH-Net: A Fast Hierarchical Network for Scene Flow Estimation on Real-World Point Clouds

Abstract

Estimating scene flow from real-world point clouds is a fundamental task for practical 3D vision. Previous methods often rely on deep models to first extract expensive per-point features at full resolution, and then get the flow either from complex matching mechanism or feature decoding, suffering high computational cost and latency. In this work, we propose a fast hierarchical network, FH-Net, which directly gets the key points flow through a lightweight Trans-flow layer utilizing the reliable local geometry prior, and optionally back-propagates the computed sparse flows through an inverse Trans-up layer to obtain hierarchical flows at different resolutions. To focus more on challenging dynamic objects, we also provide a new copy-and-paste data augmentation technique based on dynamic object pairs generation. Moreover, to alleviate the chronic shortage of real-world training data, we establish two new large-scale datasets to this field by collecting lidar-scanned point clouds from public autonomous driving datasets and annotating the collected data through novel pseudo-labeling. Extensive experiments on both public and proposed datasets show that our method outperforms prior state-of-the-arts while running at least 7× faster at 113 FPS. Code and data are released at https://github.com/pigtigger/FH-Net.

Cite

Text

Ding et al. "FH-Net: A Fast Hierarchical Network for Scene Flow Estimation on Real-World Point Clouds." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19842-7_13

Markdown

[Ding et al. "FH-Net: A Fast Hierarchical Network for Scene Flow Estimation on Real-World Point Clouds." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/ding2022eccv-fhnet/) doi:10.1007/978-3-031-19842-7_13

BibTeX

@inproceedings{ding2022eccv-fhnet,
  title     = {{FH-Net: A Fast Hierarchical Network for Scene Flow Estimation on Real-World Point Clouds}},
  author    = {Ding, Lihe and Dong, Shaocong and Xu, Tingfa and Xu, Xinli and Wang, Jie and Li, Jianan},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-19842-7_13},
  url       = {https://mlanthology.org/eccv/2022/ding2022eccv-fhnet/}
}