Weakly Supervised Learning of Semantic Correspondence Through Cascaded Online Correspondence Refinement
Abstract
In this paper, we develop a weakly supervised learning algorithm to learn robust semantic correspondences from large-scale datasets with only image-level labels. Following the spirit of multiple instance learning (MIL), we decompose the weakly supervised correspondence learning problem into three stages: image-level matching, region-level matching, and pixel-level matching. We propose a novel cascaded online correspondence refinement algorithm to integrate MIL and the correspondence filtering and refinement procedure into a single deep network and train this network end-to-end with only image-level supervision, i.e., without point-to-point matching information. During the correspondence learning process, pixel-to-pixel matching pairs inferred from weak supervision are propagated, filtered, and enhanced through masked correspondence voting and calibration. Besides, we design a correspondence consistency check algorithm to select images with discriminative key points to generate pseudo-labels for classical matching algorithms. Finally, we filter out about 110,000 images from the ImageNet ILSVRC training set to formulate a new dataset, called SC-ImageNet. Experiments on several popular benchmarks indicate that pre-training on SC-ImageNet can improve the performance of state-of-the-art algorithms efficiently. Our project is available on https://github.com/21210240056/SC-ImageNet.
Cite
Text
Huang et al. "Weakly Supervised Learning of Semantic Correspondence Through Cascaded Online Correspondence Refinement." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.01489Markdown
[Huang et al. "Weakly Supervised Learning of Semantic Correspondence Through Cascaded Online Correspondence Refinement." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/huang2023iccv-weakly/) doi:10.1109/ICCV51070.2023.01489BibTeX
@inproceedings{huang2023iccv-weakly,
title = {{Weakly Supervised Learning of Semantic Correspondence Through Cascaded Online Correspondence Refinement}},
author = {Huang, Yiwen and Sun, Yixuan and Lai, Chenghang and Xu, Qing and Wang, Xiaomei and Shen, Xuli and Ge, Weifeng},
booktitle = {International Conference on Computer Vision},
year = {2023},
pages = {16254-16263},
doi = {10.1109/ICCV51070.2023.01489},
url = {https://mlanthology.org/iccv/2023/huang2023iccv-weakly/}
}