Layerwise Systematic Scan: Deep Boltzmann Machines and Beyond

Abstract

For Markov chain Monte Carlo methods, one of the greatest discrepancies between theory and system is the scan order - while most theoretical development on the mixing time analysis deals with random updates, real-world systems are implemented with systematic scans. We bridge this gap for models that exhibit a bipartite structure, including, most notably, the Restricted/Deep Boltzmann Machine. The de facto implementation for these models scans variables in a layerwise fashion. We show that the Gibbs sampler with a layerwise alternating scan order has its relaxation time (in terms of epochs) no larger than that of a random-update Gibbs sampler (in terms of variable updates). We also construct examples to show that this bound is asymptotically tight. Through standard inequalities, our result also implies a comparison on the mixing times.

Cite

Text

Guo et al. "Layerwise Systematic Scan: Deep Boltzmann Machines and Beyond." International Conference on Artificial Intelligence and Statistics, 2018.

Markdown

[Guo et al. "Layerwise Systematic Scan: Deep Boltzmann Machines and Beyond." International Conference on Artificial Intelligence and Statistics, 2018.](https://mlanthology.org/aistats/2018/guo2018aistats-layerwise/)

BibTeX

@inproceedings{guo2018aistats-layerwise,
  title     = {{Layerwise Systematic Scan: Deep Boltzmann Machines and Beyond}},
  author    = {Guo, Heng and Kara, Kaan and Zhang, Ce},
  booktitle = {International Conference on Artificial Intelligence and Statistics},
  year      = {2018},
  pages     = {178-187},
  url       = {https://mlanthology.org/aistats/2018/guo2018aistats-layerwise/}
}