Modeling Documents with Deep Boltzmann Machines

Nitish Srivastava, Ruslan Salakhutdinov, Geoffrey E. Hinton

UAI 2013

/uai/2013/srivastava2013uai-modeling/

Abstract

We introduce a type of Deep Boltzmann Machine (DBM) that is suitable for extracting distributed semantic representations from a large unstructured collection of documents. We overcome the apparent difficulty of training a DBM with judicious parameter tying. This enables an efficient pretraining algorithm and a state initialization scheme for fast inference. The model can be trained just as efficiently as a standard Restricted Boltzmann Machine. Our experiments show that the model assigns better log probability to unseen data than the Replicated Softmax model. Features extracted from our model outperform LDA, Replicated Softmax, and DocNADE models on document retrieval and document classification tasks.

UAI Semantic Scholar

Cite

Text

Srivastava et al. "Modeling Documents with Deep Boltzmann Machines." Conference on Uncertainty in Artificial Intelligence, 2013.

Markdown

[Srivastava et al. "Modeling Documents with Deep Boltzmann Machines." Conference on Uncertainty in Artificial Intelligence, 2013.](https://mlanthology.org/uai/2013/srivastava2013uai-modeling/)

BibTeX

@inproceedings{srivastava2013uai-modeling,
  title     = {{Modeling Documents with Deep Boltzmann Machines}},
  author    = {Srivastava, Nitish and Salakhutdinov, Ruslan and Hinton, Geoffrey E.},
  booktitle = {Conference on Uncertainty in Artificial Intelligence},
  year      = {2013},
  url       = {https://mlanthology.org/uai/2013/srivastava2013uai-modeling/}
}