Modeling Documents with Deep Boltzmann Machines
Abstract
We introduce a type of Deep Boltzmann Machine (DBM) that is suitable for extracting distributed semantic representations from a large unstructured collection of documents. We overcome the apparent difficulty of training a DBM with judicious parameter tying. This enables an efficient pretraining algorithm and a state initialization scheme for fast inference. The model can be trained just as efficiently as a standard Restricted Boltzmann Machine. Our experiments show that the model assigns better log probability to unseen data than the Replicated Softmax model. Features extracted from our model outperform LDA, Replicated Softmax, and DocNADE models on document retrieval and document classification tasks.
Cite
Text
Srivastava et al. "Modeling Documents with Deep Boltzmann Machines." Conference on Uncertainty in Artificial Intelligence, 2013.Markdown
[Srivastava et al. "Modeling Documents with Deep Boltzmann Machines." Conference on Uncertainty in Artificial Intelligence, 2013.](https://mlanthology.org/uai/2013/srivastava2013uai-modeling/)BibTeX
@inproceedings{srivastava2013uai-modeling,
title = {{Modeling Documents with Deep Boltzmann Machines}},
author = {Srivastava, Nitish and Salakhutdinov, Ruslan and Hinton, Geoffrey E.},
booktitle = {Conference on Uncertainty in Artificial Intelligence},
year = {2013},
url = {https://mlanthology.org/uai/2013/srivastava2013uai-modeling/}
}