Document Neural Autoregressive Distribution Estimation

Abstract

We present an approach based on feed-forward neural networks for learning the distribution over textual documents. This approach is inspired by the Neural Autoregressive Distribution Estimator (NADE) model which has been shown to be a good estimator of the distribution over discrete-valued high-dimensional vectors. In this paper, we present how NADE can successfully be adapted to textual data, retaining the property that sampling or computing the probability of an observation can be done exactly and efficiently. The approach can also be used to learn deep representations of documents that are competitive to those learned by alternative topic modeling approaches. Finally, we describe how the approach can be combined with a regular neural network N-gram model and substantially improve its performance, by making its learned representation sensitive to the larger, document-level context.

Cite

Text

Lauly et al. "Document Neural Autoregressive Distribution Estimation." Journal of Machine Learning Research, 2017.

Markdown

[Lauly et al. "Document Neural Autoregressive Distribution Estimation." Journal of Machine Learning Research, 2017.](https://mlanthology.org/jmlr/2017/lauly2017jmlr-document/)

BibTeX

@article{lauly2017jmlr-document,
  title     = {{Document Neural Autoregressive Distribution Estimation}},
  author    = {Lauly, Stanislas and Zheng, Yin and Allauzen, Alexandre and Larochelle, Hugo},
  journal   = {Journal of Machine Learning Research},
  year      = {2017},
  pages     = {1-24},
  volume    = {18},
  url       = {https://mlanthology.org/jmlr/2017/lauly2017jmlr-document/}
}