Decoupling Common and Unique Representations for Multimodal Self-Supervised Learning
Abstract
The increasing availability of multi-sensor data sparks wide interest in multimodal self-supervised learning. However, most existing approaches learn only common representations across modalities while ignoring intra-modal training and modality-unique representations. We propose Decoupling Common and Unique Representations (DeCUR), a simple yet effective method for multimodal self-supervised learning. By distinguishing inter- and intra-modal embeddings through multimodal redundancy reduction, DeCUR can integrate complementary information across different modalities. We evaluate DeCUR in three common multimodal scenarios (radar-optical, RGB-elevation, and RGB-depth), and demonstrate its consistent improvement regardless of architectures and for both multimodal and modality-missing settings. With thorough experiments and comprehensive analysis, we hope this work can provide valuable insights and raise more interest in researching the hidden relationships of multimodal representations1 . 1 https://github.com/zhu-xlab/DeCUR
Cite
Text
Wang et al. "Decoupling Common and Unique Representations for Multimodal Self-Supervised Learning." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73397-0_17Markdown
[Wang et al. "Decoupling Common and Unique Representations for Multimodal Self-Supervised Learning." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/wang2024eccv-decoupling/) doi:10.1007/978-3-031-73397-0_17BibTeX
@inproceedings{wang2024eccv-decoupling,
title = {{Decoupling Common and Unique Representations for Multimodal Self-Supervised Learning}},
author = {Wang, Yi and Albrecht, Conrad M and Braham, Nassim Ait Ali and Liu, Chenying and Xiong, Zhitong and Zhu, Xiao Xiang},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-73397-0_17},
url = {https://mlanthology.org/eccv/2024/wang2024eccv-decoupling/}
}