Approximation Algorithms for Tensor Clustering
Abstract
We present the first (to our knowledge) approximation algorithm for tensor clustering—a powerful generalization to basic 1D clustering. Tensors are increasingly common in modern applications dealing with complex heterogeneous data and clustering them is a fundamental tool for data analysis and pattern discovery. Akin to their 1D cousins, common tensor clustering formulations are NP-hard to optimize. But, unlike the 1D case, no approximation algorithms seem to be known. We address this imbalance and build on recent co-clustering work to derive a tensor clustering algorithm with approximation guarantees, allowing metrics and divergences (e.g., Bregman) as objective functions. Therewith, we answer two open questions by Anagnostopoulos et al. (2008). Our analysis yields a constant approximation factor independent of data size; a worst-case example shows this factor to be tight for Euclidean co-clustering. However, empirically the approximation factor is observed to be conservative, so our method can also be used in practice.
Cite
Text
Jegelka et al. "Approximation Algorithms for Tensor Clustering." International Conference on Algorithmic Learning Theory, 2009. doi:10.1007/978-3-642-04414-4_30Markdown
[Jegelka et al. "Approximation Algorithms for Tensor Clustering." International Conference on Algorithmic Learning Theory, 2009.](https://mlanthology.org/alt/2009/jegelka2009alt-approximation/) doi:10.1007/978-3-642-04414-4_30BibTeX
@inproceedings{jegelka2009alt-approximation,
title = {{Approximation Algorithms for Tensor Clustering}},
author = {Jegelka, Stefanie and Sra, Suvrit and Banerjee, Arindam},
booktitle = {International Conference on Algorithmic Learning Theory},
year = {2009},
pages = {368-383},
doi = {10.1007/978-3-642-04414-4_30},
url = {https://mlanthology.org/alt/2009/jegelka2009alt-approximation/}
}