Cross-Domain Semantic Segmentation via Domain-Invariant Interactive Relation Transfer

Abstract

Exploiting photo-realistic synthetic data to train semantic segmentation models has received increasing attention over the past years. However, the domain mismatch between synthetic and real images will cause a significant performance drop when the model trained with synthetic images is directly applied to real-world scenarios. In this paper, we propose a new domain adaptation approach, called Pivot Interaction Transfer (PIT). Our method mainly focuses on constructing pivot information that is common knowledge shared across domains as a bridge to promote the adaptation of semantic segmentation model from synthetic domains to real-world domains. Specifically, we first infer the image-level category information about the target images, which is then utilized to facilitate pixel-level transfer for semantic segmentation, with the assumption that the interactive relation between the image-level category information and the pixel-level semantic information is invariant across domains. To this end, we propose a novel multi-level region expansion mechanism that aligns both the image-level and pixel-level information. Comprehensive experiments on the adaptation from both GTAV and SYNTHIA to Cityscapes clearly demonstrate the superiority of our method.

Cite

Text

Lv et al. "Cross-Domain Semantic Segmentation via Domain-Invariant Interactive Relation Transfer." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.00439

Markdown

[Lv et al. "Cross-Domain Semantic Segmentation via Domain-Invariant Interactive Relation Transfer." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/lv2020cvpr-crossdomain/) doi:10.1109/CVPR42600.2020.00439

BibTeX

@inproceedings{lv2020cvpr-crossdomain,
  title     = {{Cross-Domain Semantic Segmentation via Domain-Invariant Interactive Relation Transfer}},
  author    = {Lv, Fengmao and Liang, Tao and Chen, Xiang and Lin, Guosheng},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.00439},
  url       = {https://mlanthology.org/cvpr/2020/lv2020cvpr-crossdomain/}
}