Balanced Clustering: A Uniform Model and Fast Algorithm
Abstract
Clustering is a fundamental research topic in data mining and machine learning. In addition, many specific applications demand that the clusters obtained be balanced. In this paper, we present a balanced clustering model that is to minimize the sum of squared distances to cluster centers, with uniform regularization functions to control the balance degree of the clustering results. To solve the model, we adopt the idea of the k-means method. We show that the k-means assignment step has an equivalent minimum cost flow formulation when the regularization functions are all convex. By using a novel and simple acceleration technique for the k-means and network simplex methods our model can be solved quite efficiently. Experimental results over benchmarks validate the advantage of our algorithm compared to the state-of-the-art balanced clustering algorithms. On most datasets, our algorithm runs more than 100 times faster than previous algorithms with a better solution.
Cite
Text
Lin et al. "Balanced Clustering: A Uniform Model and Fast Algorithm." International Joint Conference on Artificial Intelligence, 2019. doi:10.24963/IJCAI.2019/414Markdown
[Lin et al. "Balanced Clustering: A Uniform Model and Fast Algorithm." International Joint Conference on Artificial Intelligence, 2019.](https://mlanthology.org/ijcai/2019/lin2019ijcai-balanced/) doi:10.24963/IJCAI.2019/414BibTeX
@inproceedings{lin2019ijcai-balanced,
title = {{Balanced Clustering: A Uniform Model and Fast Algorithm}},
author = {Lin, Weibo and He, Zhu and Xiao, Mingyu},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2019},
pages = {2987-2993},
doi = {10.24963/IJCAI.2019/414},
url = {https://mlanthology.org/ijcai/2019/lin2019ijcai-balanced/}
}