Asymptotic Statistical Analysis of Sparse Group LASSO via Approximate Message Passing

Abstract

Sparse Group LASSO (SGL) is a regularized model for high-dimensional linear regression problems with grouped covariates. SGL applies $l_1$ and $l_2$ penalties on the individual predictors and group predictors, respectively, to guarantee sparse effects both on the inter-group and within-group levels. In this paper, we apply the approximate message passing (AMP) algorithm to efficiently solve the SGL problem under Gaussian random designs. We further use the recently developed state evolution analysis of AMP to derive an asymptotically exact characterization of SGL solution. This allows us to conduct multiple fine-grained statistical analyses of SGL, through which we investigate the effects of the group information and $\gamma$ (proportion of $\ell_1$ penalty). With the lens of various performance measures, we show that SGL with small $\gamma$ benefits significantly from the group information and can outperform other SGL (including LASSO) or regularized models which does not exploit the group information, in terms of the recovery rate of signal, false discovery rate and mean squared error.

Cite

Text

Chen et al. "Asymptotic Statistical Analysis of Sparse Group LASSO via Approximate Message Passing." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2021. doi:10.1007/978-3-030-86523-8_31

Markdown

[Chen et al. "Asymptotic Statistical Analysis of Sparse Group LASSO via Approximate Message Passing." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2021.](https://mlanthology.org/ecmlpkdd/2021/chen2021ecmlpkdd-asymptotic/) doi:10.1007/978-3-030-86523-8_31

BibTeX

@inproceedings{chen2021ecmlpkdd-asymptotic,
  title     = {{Asymptotic Statistical Analysis of Sparse Group LASSO via Approximate Message Passing}},
  author    = {Chen, Kan and Bu, Zhiqi and Xu, Shiyun},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2021},
  pages     = {510-526},
  doi       = {10.1007/978-3-030-86523-8_31},
  url       = {https://mlanthology.org/ecmlpkdd/2021/chen2021ecmlpkdd-asymptotic/}
}