Statistical-Computational Tradeoff in Single Index Models
Abstract
We study the statistical-computational tradeoffs in a high dimensional single index model $Y=f(X^\top\beta^*) +\epsilon$, where $f$ is unknown, $X$ is a Gaussian vector and $\beta^*$ is $s$-sparse with unit norm. When $\cov(Y,X^\top\beta^*)\neq 0$, \cite{plan2016generalized} shows that the direction and support of $\beta^*$ can be recovered using a generalized version of Lasso. In this paper, we investigate the case when this critical assumption fails to hold, where the problem becomes considerably harder. Using the statistical query model to characterize the computational cost of an algorithm, we show that when $\cov(Y,X^\top\beta^*)=0$ and $\cov(Y,(X^\top\beta^*)^2)>0$, no computationally tractable algorithms can achieve the information-theoretic limit of the minimax risk. This implies that one must pay an extra computational cost for the nonlinearity involved in the model.
Cite
Text
Wang et al. "Statistical-Computational Tradeoff in Single Index Models." Neural Information Processing Systems, 2019.Markdown
[Wang et al. "Statistical-Computational Tradeoff in Single Index Models." Neural Information Processing Systems, 2019.](https://mlanthology.org/neurips/2019/wang2019neurips-statisticalcomputational/)BibTeX
@inproceedings{wang2019neurips-statisticalcomputational,
title = {{Statistical-Computational Tradeoff in Single Index Models}},
author = {Wang, Lingxiao and Yang, Zhuoran and Wang, Zhaoran},
booktitle = {Neural Information Processing Systems},
year = {2019},
pages = {10419-10426},
url = {https://mlanthology.org/neurips/2019/wang2019neurips-statisticalcomputational/}
}