Tight Nonparametric Convergence Rates for Stochastic Gradient Descent Under the Noiseless Linear Model
Abstract
In the context of statistical supervised learning, the noiseless linear model assumes that there exists a deterministic linear relation $Y = \langle \theta_*, \Phi(U) \rangle$ between the random output $Y$ and the random feature vector $\Phi(U)$, a potentially non-linear transformation of the inputs~$U$. We analyze the convergence of single-pass, fixed step-size stochastic gradient descent on the least-square risk under this model. The convergence of the iterates to the optimum $\theta_*$ and the decay of the generalization error follow polynomial convergence rates with exponents that both depend on the regularities of the optimum $\theta_*$ and of the feature vectors $\Phi(U)$. We interpret our result in the reproducing kernel Hilbert space framework. As a special case, we analyze an online algorithm for estimating a real function on the unit hypercube from the noiseless observation of its value at randomly sampled points; the convergence depends on the Sobolev smoothness of the function and of a chosen kernel. Finally, we apply our analysis beyond the supervised learning setting to obtain convergence rates for the averaging process (a.k.a. gossip algorithm) on a graph depending on its spectral dimension.
Cite
Text
Berthier et al. "Tight Nonparametric Convergence Rates for Stochastic Gradient Descent Under the Noiseless Linear Model." Neural Information Processing Systems, 2020.Markdown
[Berthier et al. "Tight Nonparametric Convergence Rates for Stochastic Gradient Descent Under the Noiseless Linear Model." Neural Information Processing Systems, 2020.](https://mlanthology.org/neurips/2020/berthier2020neurips-tight/)BibTeX
@inproceedings{berthier2020neurips-tight,
title = {{Tight Nonparametric Convergence Rates for Stochastic Gradient Descent Under the Noiseless Linear Model}},
author = {Berthier, Raphaël and Bach, Francis R. and Gaillard, Pierre},
booktitle = {Neural Information Processing Systems},
year = {2020},
url = {https://mlanthology.org/neurips/2020/berthier2020neurips-tight/}
}