Learning Semantic Segmentation from Multiple Datasets with Label Shifts

Abstract

While it is desirable to train segmentation models on an aggregation of multiple datasets, a major challenge is that the label space of each dataset may be in conflict with one another. To tackle this challenge, we propose UniSeg, an effective and model-agnostic approach to automatically train segmentation models across multiple datasets with heterogeneous label spaces, without requiring any manual relabeling efforts. Specifically, we introduce two new ideas that account for conflicting and co-occurring labels to achieve better generalization performance in unseen domains. First, we identify a gradient conflict in training incurred by mismatched label spaces and propose a class-independent binary cross-entropy loss to alleviate such label conflicts. Second, we propose a loss function that considers class-relationships across datasets for a better multi-dataset training scheme.xtensive quantitative and qualitative analyses on road-scene datasets show that UniSeg improves over multi-dataset baselines, especially on unseen datasets, e.g., achieving more than 8%p gain in IoU on KITTI. Furthermore, UniSeg achieves 39.4% IoU on the WildDash2 public benchmark, making it one of the strongest submissions in the zero-shot setting. Our project page is available at https://www.nec-labs.com/ mas/UniSeg.

Cite

Text

Kim et al. "Learning Semantic Segmentation from Multiple Datasets with Label Shifts." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19815-1_2

Markdown

[Kim et al. "Learning Semantic Segmentation from Multiple Datasets with Label Shifts." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/kim2022eccv-learning/) doi:10.1007/978-3-031-19815-1_2

BibTeX

@inproceedings{kim2022eccv-learning,
  title     = {{Learning Semantic Segmentation from Multiple Datasets with Label Shifts}},
  author    = {Kim, Dongwan and Tsai, Yi-Hsuan and Suh, Yumin and Faraki, Masoud and Garg, Sparsh and Chandraker, Manmohan and Han, Bohyung},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-19815-1_2},
  url       = {https://mlanthology.org/eccv/2022/kim2022eccv-learning/}
}