Variational Continual Bayesian Meta-Learning
Abstract
Conventional meta-learning considers a set of tasks from a stationary distribution. In contrast, this paper focuses on a more complex online setting, where tasks arrive sequentially and follow a non-stationary distribution. Accordingly, we propose a Variational Continual Bayesian Meta-Learning (VC-BML) algorithm. VC-BML maintains a Dynamic Gaussian Mixture Model for meta-parameters, with the number of component distributions determined by a Chinese Restaurant Process. Dynamic mixtures at the meta-parameter level increase the capability to adapt to diverse tasks due to a larger parameter space, alleviating the negative knowledge transfer problem. To infer posteriors of model parameters, compared to the previously used point estimation method, we develop a more robust posterior approximation method -- structured variational inference for the sake of avoiding forgetting knowledge. Experiments on tasks from non-stationary distributions show that VC-BML is superior in transferring knowledge among diverse tasks and alleviating catastrophic forgetting in an online setting.
Cite
Text
Zhang et al. "Variational Continual Bayesian Meta-Learning." Neural Information Processing Systems, 2021.Markdown
[Zhang et al. "Variational Continual Bayesian Meta-Learning." Neural Information Processing Systems, 2021.](https://mlanthology.org/neurips/2021/zhang2021neurips-variational/)BibTeX
@inproceedings{zhang2021neurips-variational,
title = {{Variational Continual Bayesian Meta-Learning}},
author = {Zhang, Qiang and Fang, Jinyuan and Meng, Zaiqiao and Liang, Shangsong and Yilmaz, Emine},
booktitle = {Neural Information Processing Systems},
year = {2021},
url = {https://mlanthology.org/neurips/2021/zhang2021neurips-variational/}
}