Weakly Supervised Learning of Semantic Correspondence Through Cascaded Online Correspondence Refinement

Abstract

In this paper, we develop a weakly supervised learning algorithm to learn robust semantic correspondences from large-scale datasets with only image-level labels. Following the spirit of multiple instance learning (MIL), we decompose the weakly supervised correspondence learning problem into three stages: image-level matching, region-level matching, and pixel-level matching. We propose a novel cascaded online correspondence refinement algorithm to integrate MIL and the correspondence filtering and refinement procedure into a single deep network and train this network end-to-end with only image-level supervision, i.e., without point-to-point matching information. During the correspondence learning process, pixel-to-pixel matching pairs inferred from weak supervision are propagated, filtered, and enhanced through masked correspondence voting and calibration. Besides, we design a correspondence consistency check algorithm to select images with discriminative key points to generate pseudo-labels for classical matching algorithms. Finally, we filter out about 110,000 images from the ImageNet ILSVRC training set to formulate a new dataset, called SC-ImageNet. Experiments on several popular benchmarks indicate that pre-training on SC-ImageNet can improve the performance of state-of-the-art algorithms efficiently. Our project is available on https://github.com/21210240056/SC-ImageNet.

Cite

Text

Huang et al. "Weakly Supervised Learning of Semantic Correspondence Through Cascaded Online Correspondence Refinement." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.01489

Markdown

[Huang et al. "Weakly Supervised Learning of Semantic Correspondence Through Cascaded Online Correspondence Refinement." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/huang2023iccv-weakly/) doi:10.1109/ICCV51070.2023.01489

BibTeX

@inproceedings{huang2023iccv-weakly,
  title     = {{Weakly Supervised Learning of Semantic Correspondence Through Cascaded Online Correspondence Refinement}},
  author    = {Huang, Yiwen and Sun, Yixuan and Lai, Chenghang and Xu, Qing and Wang, Xiaomei and Shen, Xuli and Ge, Weifeng},
  booktitle = {International Conference on Computer Vision},
  year      = {2023},
  pages     = {16254-16263},
  doi       = {10.1109/ICCV51070.2023.01489},
  url       = {https://mlanthology.org/iccv/2023/huang2023iccv-weakly/}
}