Decoupling Common and Unique Representations for Multimodal Self-Supervised Learning

Abstract

The increasing availability of multi-sensor data sparks wide interest in multimodal self-supervised learning. However, most existing approaches learn only common representations across modalities while ignoring intra-modal training and modality-unique representations. We propose Decoupling Common and Unique Representations (DeCUR), a simple yet effective method for multimodal self-supervised learning. By distinguishing inter- and intra-modal embeddings through multimodal redundancy reduction, DeCUR can integrate complementary information across different modalities. We evaluate DeCUR in three common multimodal scenarios (radar-optical, RGB-elevation, and RGB-depth), and demonstrate its consistent improvement regardless of architectures and for both multimodal and modality-missing settings. With thorough experiments and comprehensive analysis, we hope this work can provide valuable insights and raise more interest in researching the hidden relationships of multimodal representations1 . 1 https://github.com/zhu-xlab/DeCUR

Cite

Text

Wang et al. "Decoupling Common and Unique Representations for Multimodal Self-Supervised Learning." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73397-0_17

Markdown

[Wang et al. "Decoupling Common and Unique Representations for Multimodal Self-Supervised Learning." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/wang2024eccv-decoupling/) doi:10.1007/978-3-031-73397-0_17

BibTeX

@inproceedings{wang2024eccv-decoupling,
  title     = {{Decoupling Common and Unique Representations for Multimodal Self-Supervised Learning}},
  author    = {Wang, Yi and Albrecht, Conrad M and Braham, Nassim Ait Ali and Liu, Chenying and Xiong, Zhitong and Zhu, Xiao Xiang},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-73397-0_17},
  url       = {https://mlanthology.org/eccv/2024/wang2024eccv-decoupling/}
}