Divergence Triangle for Joint Training of Generator Model, Energy-Based Model, and Inferential Model

Abstract

This paper proposes the divergence triangle as a framework for joint training of a generator model, energy-based model and inference model. The divergence triangle is a compact and symmetric (anti-symmetric) objective function that seamlessly integrates variational learning, adversarial learning, wake-sleep algorithm, and contrastive divergence in a unified probabilistic formulation. This unification makes the processes of sampling, inference, and energy evaluation readily available without the need for costly Markov chain Monte Carlo methods. Our experiments demonstrate that the divergence triangle is capable of learning (1) an energy-based model with well-formed energy landscape, (2) direct sampling in the form of a generator network, and (3) feed-forward inference that faithfully reconstructs observed as well as synthesized data.

Cite

Text

Han et al. "Divergence Triangle for Joint Training of Generator Model, Energy-Based Model, and Inferential Model." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. doi:10.1109/CVPR.2019.00887

Markdown

[Han et al. "Divergence Triangle for Joint Training of Generator Model, Energy-Based Model, and Inferential Model." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.](https://mlanthology.org/cvpr/2019/han2019cvpr-divergence/) doi:10.1109/CVPR.2019.00887

BibTeX

@inproceedings{han2019cvpr-divergence,
  title     = {{Divergence Triangle for Joint Training of Generator Model, Energy-Based Model, and Inferential Model}},
  author    = {Han, Tian and Nijkamp, Erik and Fang, Xiaolin and Hill, Mitch and Zhu, Song-Chun and Wu, Ying Nian},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2019},
  doi       = {10.1109/CVPR.2019.00887},
  url       = {https://mlanthology.org/cvpr/2019/han2019cvpr-divergence/}
}