Match-Stereo-Videos: Bidirectional Alignment for Consistent Dynamic Stereo Matching

Abstract

Dynamic stereo matching is the task of estimating consistent disparities from stereo videos with dynamic objects. Recent learning-based methods prioritize optimal performance on a single stereo pair, resulting in temporal inconsistencies. Existing video methods apply per-frame matching and window-based cost aggregation across the time dimension, leading to low-frequency oscillations at the scale of the window size. Towards this challenge, we develop a bidirectional alignment mechanism for adjacent frames as a fundamental operation. We further propose a novel framework, BiDAStereo, that achieves consistent dynamic stereo matching. Unlike the existing methods, we model this task as local matching and global aggregation. Locally, we consider correlation in a triple-frame manner to pool information from adjacent frames and improve the temporal consistency. Globally, to exploit the entire sequence’s consistency and extract dynamic scene cues for aggregation, we develop a motion-propagation recurrent unit. Extensive experiments demonstrate the performance of our method, showcasing improvements in prediction quality and achieving SoTA results on commonly used benchmarks.

Cite

Text

Jing et al. "Match-Stereo-Videos: Bidirectional Alignment for Consistent Dynamic Stereo Matching." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73027-6_24

Markdown

[Jing et al. "Match-Stereo-Videos: Bidirectional Alignment for Consistent Dynamic Stereo Matching." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/jing2024eccv-matchstereovideos/) doi:10.1007/978-3-031-73027-6_24

BibTeX

@inproceedings{jing2024eccv-matchstereovideos,
  title     = {{Match-Stereo-Videos: Bidirectional Alignment for Consistent Dynamic Stereo Matching}},
  author    = {Jing, Junpeng and Mao, Ye and Mikolajczyk, Krystian},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-73027-6_24},
  url       = {https://mlanthology.org/eccv/2024/jing2024eccv-matchstereovideos/}
}