Reconsidering Representation Alignment for Multi-View Clustering
Abstract
Aligning distributions of view representations is a core component of today's state of the art models for deep multi-view clustering. However, we identify several drawbacks with naively aligning representation distributions. We demonstrate that these drawbacks both lead to less separable clusters in the representation space, and inhibit the model's ability to prioritize views. Based on these observations, we develop a simple baseline model for deep multi-view clustering. Our baseline model avoids representation alignment altogether, while performing similar to, or better than, the current state of the art. We also expand our baseline model by adding a contrastive learning component. This introduces a selective alignment procedure that preserves the model's ability to prioritize views. Our experiments show that the contrastive learning component enhances the baseline model, improving on the current state of the art by a large margin on several datasets.
Cite
Text
Trosten et al. "Reconsidering Representation Alignment for Multi-View Clustering." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.00131Markdown
[Trosten et al. "Reconsidering Representation Alignment for Multi-View Clustering." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/trosten2021cvpr-reconsidering/) doi:10.1109/CVPR46437.2021.00131BibTeX
@inproceedings{trosten2021cvpr-reconsidering,
title = {{Reconsidering Representation Alignment for Multi-View Clustering}},
author = {Trosten, Daniel J. and Lokse, Sigurd and Jenssen, Robert and Kampffmeyer, Michael},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2021},
pages = {1255-1265},
doi = {10.1109/CVPR46437.2021.00131},
url = {https://mlanthology.org/cvpr/2021/trosten2021cvpr-reconsidering/}
}