Exact Subspace Clustering in Linear Time

Abstract

Subspace clustering is an important unsupervised learning problem with wide applications in computer vision and data analysis. However, the state-of-the-art methods for this problem suffer from high time complexity---quadratic or cubic in $n$ (the number of data instances). In this paper we exploit a data selection algorithm to speedup computation and the robust principal component analysis to strengthen robustness. Accordingly, we devise a scalable and robust subspace clustering method which costs time only linear in $n$. We prove theoretically that under certain mild assumptions our method solves the subspace clustering problem exactly even for grossly corrupted data. Our algorithm is based on very simple ideas, yet it is the only linear time algorithm with noiseless or noisy recovery guarantee. Finally, empirical results verify our theoretical analysis.

Cite

Text

Wang et al. "Exact Subspace Clustering in Linear Time." AAAI Conference on Artificial Intelligence, 2014. doi:10.1609/AAAI.V28I1.8963

Markdown

[Wang et al. "Exact Subspace Clustering in Linear Time." AAAI Conference on Artificial Intelligence, 2014.](https://mlanthology.org/aaai/2014/wang2014aaai-exact/) doi:10.1609/AAAI.V28I1.8963

BibTeX

@inproceedings{wang2014aaai-exact,
  title     = {{Exact Subspace Clustering in Linear Time}},
  author    = {Wang, Shusen and Tu, Bojun and Xu, Congfu and Zhang, Zhihua},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2014},
  pages     = {2113-2120},
  doi       = {10.1609/AAAI.V28I1.8963},
  url       = {https://mlanthology.org/aaai/2014/wang2014aaai-exact/}
}