Robust Regression Revisited: Acceleration and Improved Estimation Rates
Abstract
We study fast algorithms for statistical regression problems under the strong contamination model, where the goal is to approximately optimize a generalized linear model (GLM) given adversarially corrupted samples. Prior works in this line of research were based on the \emph{robust gradient descent} framework of \cite{PrasadSBR20}, a first-order method using biased gradient queries, or the \emph{Sever} framework of \cite{DiakonikolasKK019}, an iterative outlier-removal method calling a stationary point finder. We present nearly-linear time algorithms for robust regression problems with improved runtime or estimation guarantees compared to the state-of-the-art. For the general case of smooth GLMs (e.g.\ logistic regression), we show that the robust gradient descent framework of \cite{PrasadSBR20} can be \emph{accelerated}, and show our algorithm extends to optimizing the Moreau envelopes of Lipschitz GLMs (e.g.\ support vector machines), answering several open questions in the literature. For the well-studied case of robust linear regression, we present an alternative approach obtaining improved estimation rates over prior nearly-linear time algorithms. Interestingly, our algorithm starts with an identifiability proof introduced in the context of the sum-of-squares algorithm of \cite{BakshiP21}, which achieved optimal error rates while requiring large polynomial runtime and sample complexity. We reinterpret their proof within the Sever framework and obtain a dramatically faster and more sample-efficient algorithm under fewer distributional assumptions.
Cite
Text
Jambulapati et al. "Robust Regression Revisited: Acceleration and Improved Estimation Rates." Neural Information Processing Systems, 2021.Markdown
[Jambulapati et al. "Robust Regression Revisited: Acceleration and Improved Estimation Rates." Neural Information Processing Systems, 2021.](https://mlanthology.org/neurips/2021/jambulapati2021neurips-robust/)BibTeX
@inproceedings{jambulapati2021neurips-robust,
title = {{Robust Regression Revisited: Acceleration and Improved Estimation Rates}},
author = {Jambulapati, Arun and Li, Jerry and Schramm, Tselil and Tian, Kevin},
booktitle = {Neural Information Processing Systems},
year = {2021},
url = {https://mlanthology.org/neurips/2021/jambulapati2021neurips-robust/}
}