Constructing Informative Priors Using Transfer Learning

Abstract

Many applications of supervised learning require good generalization from limited labeled data. In the Bayesian setting, we can try to achieve this goal by using an informative prior over the parameters, one that encodes useful domain knowledge. Focusing on logistic regression, we present an algorithm for automatically constructing a multivariate Gaussian prior with a full covariance matrix for a given supervised learning task. This prior relaxes a commonly used but overly simplistic independence assumption, and allows parameters to be dependent. The algorithm uses other "similar" learning problems to estimate the covariance of pairs of individual parameters. We then use a semidefinite program to combine these estimates and learn a good prior for the current learning task. We apply our methods to binary text classification, and demonstrate a 20 to 40% test error reduction over a commonly used prior.

Cite

Text

Raina et al. "Constructing Informative Priors Using Transfer Learning." International Conference on Machine Learning, 2006. doi:10.1145/1143844.1143934

Markdown

[Raina et al. "Constructing Informative Priors Using Transfer Learning." International Conference on Machine Learning, 2006.](https://mlanthology.org/icml/2006/raina2006icml-constructing/) doi:10.1145/1143844.1143934

BibTeX

@inproceedings{raina2006icml-constructing,
  title     = {{Constructing Informative Priors Using Transfer Learning}},
  author    = {Raina, Rajat and Ng, Andrew Y. and Koller, Daphne},
  booktitle = {International Conference on Machine Learning},
  year      = {2006},
  pages     = {713-720},
  doi       = {10.1145/1143844.1143934},
  url       = {https://mlanthology.org/icml/2006/raina2006icml-constructing/}
}