Learning from Sample Stability for Deep Clustering

Abstract

Deep clustering, an unsupervised technique independent of labels, necessitates tailored supervision for model training. Prior methods explore supervision like similarity and pseudo labels, yet overlook individual sample training analysis. Our study correlates sample stability during unsupervised training with clustering accuracy and network memorization on a per-sample basis. Unstable representations across epochs often lead to mispredictions, indicating difficulty in memorization and atypicality. Leveraging these findings, we introduce supervision signals for the first time based on sample stability at the representation level. Our proposed strategy serves as a versatile tool to enhance various deep clustering techniques. Experiments across benchmark datasets showcase that incorporating sample stability into training can improve the performance of deep clustering. The code is available at https://github.com/LZX-001/LFSS.

Cite

Text

Li et al. "Learning from Sample Stability for Deep Clustering." Proceedings of the 42nd International Conference on Machine Learning, 2025.

Markdown

[Li et al. "Learning from Sample Stability for Deep Clustering." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/li2025icml-learning/)

BibTeX

@inproceedings{li2025icml-learning,
  title     = {{Learning from Sample Stability for Deep Clustering}},
  author    = {Li, Zhixin and Jia, Yuheng and Liu, Hui and Hou, Junhui},
  booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
  year      = {2025},
  pages     = {34904-34919},
  volume    = {267},
  url       = {https://mlanthology.org/icml/2025/li2025icml-learning/}
}