Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels

Abstract

Recently, different machine learning methods have been introduced to tackle the challenging few-shot learning scenario that is, learning from a small labeled dataset related to a specific task. Common approaches have taken the form of meta-learning: learning to learn on the new problem given the old. Following the recognition that meta-learning is implementing learning in a multi-level model, we present a Bayesian treatment for the meta-learning inner loop through the use of deep kernels. As a result we can learn a kernel that transfers to new tasks; we call this Deep Kernel Transfer (DKT). This approach has many advantages: is straightforward to implement as a single optimizer, provides uncertainty quantification, and does not require estimation of task-specific parameters. We empirically demonstrate that DKT outperforms several state-of-the-art algorithms in few-shot classification, and is the state of the art for cross-domain adaptation and regression. We conclude that complex meta-learning routines can be replaced by a simpler Bayesian model without loss of accuracy.

Cite

Text

Patacchiola et al. "Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels." Neural Information Processing Systems, 2020.

Markdown

[Patacchiola et al. "Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels." Neural Information Processing Systems, 2020.](https://mlanthology.org/neurips/2020/patacchiola2020neurips-bayesian/)

BibTeX

@inproceedings{patacchiola2020neurips-bayesian,
  title     = {{Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels}},
  author    = {Patacchiola, Massimiliano and Turner, Jack and Crowley, Elliot J. and O'Boyle, Michael and Storkey, Amos J.},
  booktitle = {Neural Information Processing Systems},
  year      = {2020},
  url       = {https://mlanthology.org/neurips/2020/patacchiola2020neurips-bayesian/}
}