Robust Synthetic-to-Real Transfer for Stereo Matching
Abstract
With advancements in domain generalized stereo matching networks models pre-trained on synthetic data demonstrate strong robustness to unseen domains. However few studies have investigated the robustness after fine-tuning them in real-world scenarios during which the domain generalization ability can be seriously degraded. In this paper we explore fine-tuning stereo matching networks without compromising their robustness to unseen domains. Our motivation stems from comparing Ground Truth (GT) versus Pseudo Label (PL) for fine-tuning: GT degrades but PL preserves the domain generalization ability. Empirically we find the difference between GT and PL implies valuable information that can regularize networks during fine-tuning. We also propose a framework to utilize this difference for fine-tuning consisting of a frozen Teacher an exponential moving average (EMA) Teacher and a Student network. The core idea is to utilize the EMA Teacher to measure what the Student has learned and dynamically improve GT and PL for fine-tuning. We integrate our framework with state-of-the-art networks and evaluate its effectiveness on several real-world datasets. Extensive experiments show that our method effectively preserves the domain generalization ability during fine-tuning.
Cite
Text
Zhang et al. "Robust Synthetic-to-Real Transfer for Stereo Matching." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.01914Markdown
[Zhang et al. "Robust Synthetic-to-Real Transfer for Stereo Matching." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/zhang2024cvpr-robust/) doi:10.1109/CVPR52733.2024.01914BibTeX
@inproceedings{zhang2024cvpr-robust,
title = {{Robust Synthetic-to-Real Transfer for Stereo Matching}},
author = {Zhang, Jiawei and Li, Jiahe and Huang, Lei and Yu, Xiaohan and Gu, Lin and Zheng, Jin and Bai, Xiao},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2024},
pages = {20247-20257},
doi = {10.1109/CVPR52733.2024.01914},
url = {https://mlanthology.org/cvpr/2024/zhang2024cvpr-robust/}
}