Removing Cost Volumes from Optical Flow Estimators

Abstract

Cost volumes are used in every modern optical flow estimator, but due to their computational and space complexity, they are often a limiting factor regarding both processing speed and the resolution of input frames. Motivated by our empirical observation that cost volumes lose their importance once all other network parts of, e.g., a RAFT-based pipeline have been sufficiently trained, we introduce a training strategy that allows removing the cost volume from optical flow estimators throughout training. This leads to significantly improved inference speed and reduced memory requirements. Using our training strategy, we create three different models covering different compute budgets. Our most accurate model reaches state-of-the-art accuracy while being 1.2xfaster and having a 6xlower memory footprint than comparable models; our fastest model is capable of processing Full HD frames at 20\,\mathrm FPS using only 500\,\mathrm MB of GPU memory.

Cite

Text

Kiefhaber et al. "Removing Cost Volumes from Optical Flow Estimators." International Conference on Computer Vision, 2025.

Markdown

[Kiefhaber et al. "Removing Cost Volumes from Optical Flow Estimators." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/kiefhaber2025iccv-removing/)

BibTeX

@inproceedings{kiefhaber2025iccv-removing,
  title     = {{Removing Cost Volumes from Optical Flow Estimators}},
  author    = {Kiefhaber, Simon and Roth, Stefan and Schaub-Meyer, Simone},
  booktitle = {International Conference on Computer Vision},
  year      = {2025},
  pages     = {79-89},
  url       = {https://mlanthology.org/iccv/2025/kiefhaber2025iccv-removing/}
}