FH-Net: A Fast Hierarchical Network for Scene Flow Estimation on Real-World Point Clouds
Abstract
Estimating scene flow from real-world point clouds is a fundamental task for practical 3D vision. Previous methods often rely on deep models to first extract expensive per-point features at full resolution, and then get the flow either from complex matching mechanism or feature decoding, suffering high computational cost and latency. In this work, we propose a fast hierarchical network, FH-Net, which directly gets the key points flow through a lightweight Trans-flow layer utilizing the reliable local geometry prior, and optionally back-propagates the computed sparse flows through an inverse Trans-up layer to obtain hierarchical flows at different resolutions. To focus more on challenging dynamic objects, we also provide a new copy-and-paste data augmentation technique based on dynamic object pairs generation. Moreover, to alleviate the chronic shortage of real-world training data, we establish two new large-scale datasets to this field by collecting lidar-scanned point clouds from public autonomous driving datasets and annotating the collected data through novel pseudo-labeling. Extensive experiments on both public and proposed datasets show that our method outperforms prior state-of-the-arts while running at least 7× faster at 113 FPS. Code and data are released at https://github.com/pigtigger/FH-Net.
Cite
Text
Ding et al. "FH-Net: A Fast Hierarchical Network for Scene Flow Estimation on Real-World Point Clouds." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19842-7_13Markdown
[Ding et al. "FH-Net: A Fast Hierarchical Network for Scene Flow Estimation on Real-World Point Clouds." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/ding2022eccv-fhnet/) doi:10.1007/978-3-031-19842-7_13BibTeX
@inproceedings{ding2022eccv-fhnet,
title = {{FH-Net: A Fast Hierarchical Network for Scene Flow Estimation on Real-World Point Clouds}},
author = {Ding, Lihe and Dong, Shaocong and Xu, Tingfa and Xu, Xinli and Wang, Jie and Li, Jianan},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2022},
doi = {10.1007/978-3-031-19842-7_13},
url = {https://mlanthology.org/eccv/2022/ding2022eccv-fhnet/}
}