Exact Subspace Clustering in Linear Time
Abstract
Subspace clustering is an important unsupervised learning problem with wide applications in computer vision and data analysis. However, the state-of-the-art methods for this problem suffer from high time complexity---quadratic or cubic in $n$ (the number of data instances). In this paper we exploit a data selection algorithm to speedup computation and the robust principal component analysis to strengthen robustness. Accordingly, we devise a scalable and robust subspace clustering method which costs time only linear in $n$. We prove theoretically that under certain mild assumptions our method solves the subspace clustering problem exactly even for grossly corrupted data. Our algorithm is based on very simple ideas, yet it is the only linear time algorithm with noiseless or noisy recovery guarantee. Finally, empirical results verify our theoretical analysis.
Cite
Text
Wang et al. "Exact Subspace Clustering in Linear Time." AAAI Conference on Artificial Intelligence, 2014. doi:10.1609/AAAI.V28I1.8963Markdown
[Wang et al. "Exact Subspace Clustering in Linear Time." AAAI Conference on Artificial Intelligence, 2014.](https://mlanthology.org/aaai/2014/wang2014aaai-exact/) doi:10.1609/AAAI.V28I1.8963BibTeX
@inproceedings{wang2014aaai-exact,
title = {{Exact Subspace Clustering in Linear Time}},
author = {Wang, Shusen and Tu, Bojun and Xu, Congfu and Zhang, Zhihua},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2014},
pages = {2113-2120},
doi = {10.1609/AAAI.V28I1.8963},
url = {https://mlanthology.org/aaai/2014/wang2014aaai-exact/}
}