Deep Image Clustering with Category-Style Representation

Abstract

Deep clustering which adopts deep neural networks to obtain optimal representations for clustering has been widely studied recently. In this paper, we propose a novel deep image clustering framework to learn a category-style latent representation in which the category information is disentangled from image style and can be directly used as the cluster assignment. To achieve this goal, mutual information maximization is applied to embed relevant information in the latent representation. Moreover, augmentation-invariant loss is employed to disentangle the representation into category part and style part. Last but not least, a prior distribution is imposed on the latent representation to ensure the elements of the category vector can be used as the probabilities over clusters. Comprehensive experiments demonstrate that the proposed approach outperforms state-of-the-art methods significantly on five public datasets.

Cite

Text

Zhao et al. "Deep Image Clustering with Category-Style Representation." Proceedings of the European Conference on Computer Vision (ECCV), 2020. doi:10.1007/978-3-030-58568-6_4

Markdown

[Zhao et al. "Deep Image Clustering with Category-Style Representation." Proceedings of the European Conference on Computer Vision (ECCV), 2020.](https://mlanthology.org/eccv/2020/zhao2020eccv-deep/) doi:10.1007/978-3-030-58568-6_4

BibTeX

@inproceedings{zhao2020eccv-deep,
  title     = {{Deep Image Clustering with Category-Style Representation}},
  author    = {Zhao, Junjie and Lu, Donghuan and Ma, Kai and Zhang, Yu and Zheng, Yefeng},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2020},
  doi       = {10.1007/978-3-030-58568-6_4},
  url       = {https://mlanthology.org/eccv/2020/zhao2020eccv-deep/}
}