Clustering via Concave Minimization

Paul S. Bradley, Olvi L. Mangasarian, W. Nick Street

NeurIPS 1996 pp. 368-374

/neurips/1996/bradley1996neurips-clustering/

Abstract

The problem of assigning m points in the n-dimensional real space Rn to k clusters is formulated as that of determining k centers in Rn such that the sum of distances of each point to the nearest center is minimized. If a polyhedral distance is used, the problem can be formulated as that of minimizing a piecewise-linear concave function on a polyhedral set which is shown to be equivalent to a bilinear program: minimizing a bilinear function on a polyhe(cid:173) dral set. A fast finite k-Median Algorithm consisting of solving few linear programs in closed form leads to a stationary point of the bilinear program. Computational testing on a number of real(cid:173) world databases was carried out. On the Wisconsin Diagnostic Breast Cancer (WDBC) database, k-Median training set correct(cid:173) ness was comparable to that of the k-Mean Algorithm, however its testing set correctness was better. Additionally, on the Wisconsin Prognostic Breast Cancer (WPBC) database, distinct and clini(cid:173) cally important survival curves were extracted by the k-Median Algorithm, whereas the k-Mean Algorithm failed to obtain such distinct survival curves for the same database.

PDF NeurIPS Semantic Scholar

Cite

Text

Bradley et al. "Clustering via Concave Minimization." Neural Information Processing Systems, 1996.

Markdown

[Bradley et al. "Clustering via Concave Minimization." Neural Information Processing Systems, 1996.](https://mlanthology.org/neurips/1996/bradley1996neurips-clustering/)

BibTeX

@inproceedings{bradley1996neurips-clustering,
  title     = {{Clustering via Concave Minimization}},
  author    = {Bradley, Paul S. and Mangasarian, Olvi L. and Street, W. Nick},
  booktitle = {Neural Information Processing Systems},
  year      = {1996},
  pages     = {368-374},
  url       = {https://mlanthology.org/neurips/1996/bradley1996neurips-clustering/}
}