Sparse Gaussian Process Regression via L1 Penalization

Abstract

To handle massive data, a variety of sparse Gaussian Process (GP) methods have been proposed to reduce the computational cost. Many of them essentially map the large dataset into a small set of basis points. A common approach to learn these basis points is evidence maximization. Nevertheless, evidence maximization may lead to overfitting and cause a high computational cost. In this paper, we propose a novel sparse GP regression approach, GPLasso, that explicitly represents the trade-off between its approximation quality and the model sparsity. GPLasso minimizes a l1-penalized KL divergence between the exact and sparse GP posterior processes. Optimizing this convex cost function leads to sparse GP parameters. Furthermore, we use incomplete Cholesky factorization to obtain low-rank matrix approximations to speed up the optimization procedure. Experimental results on synthetic and real data demonstrate that, compared with several state-of-the-art sparse GP methods and a direct low-rank matrix approximation method, GPLasso achieves a significantly improved trade-off between prediction accuracy and computational cost.

Cite

Text

Yan and Qi. "Sparse Gaussian Process Regression via L1 Penalization." International Conference on Machine Learning, 2010.

Markdown

[Yan and Qi. "Sparse Gaussian Process Regression via L1 Penalization." International Conference on Machine Learning, 2010.](https://mlanthology.org/icml/2010/yan2010icml-sparse/)

BibTeX

@inproceedings{yan2010icml-sparse,
  title     = {{Sparse Gaussian Process Regression via L1 Penalization}},
  author    = {Yan, Feng and Qi, Yuan (Alan)},
  booktitle = {International Conference on Machine Learning},
  year      = {2010},
  pages     = {1183-1190},
  url       = {https://mlanthology.org/icml/2010/yan2010icml-sparse/}
}