I Can't Believe It's Not Scene Flow!
Abstract
State-of-the-art scene flow methods broadly fail to describe the motion of small objects, and existing evaluation protocols hide this failure by averaging over many points. To address this limitation, we propose Bucket Normalized EPE, a new class-aware and speed-normalized evaluation protocol that better contextualizes error comparisons between object types that move at vastly different speeds. In addition, we propose TrackFlow, a frustratingly simple supervised scene flow baseline that combines a high-quality 3D object detector (trained using standard class re-balancing techniques) with a simple Kalman filter-based tracker. Notably, TrackFlow achieves state-of-the-art performance on existing metrics and shows large improvements over prior work on our proposed metric. Our results highlight that scene flow evaluation must be class and speed aware, and supervised scene flow methods must address point-level class imbalances. Our evaluation toolkit and code is available on GitHub.
Cite
Text
Khatri et al. "I Can't Believe It's Not Scene Flow!." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72649-1_14Markdown
[Khatri et al. "I Can't Believe It's Not Scene Flow!." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/khatri2024eccv-believe/) doi:10.1007/978-3-031-72649-1_14BibTeX
@inproceedings{khatri2024eccv-believe,
title = {{I Can't Believe It's Not Scene Flow!}},
author = {Khatri, Ishan and Vedder, Kyle and Peri, Neehar and Ramanan, Deva and Hays, James},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-72649-1_14},
url = {https://mlanthology.org/eccv/2024/khatri2024eccv-believe/}
}