Topic Modeling with Nonparametric Markov Tree
Abstract
A new hierarchical tree-based topic model is developed, based on nonparametric Bayesian techniques. The model has two unique attributes: (i) a child node in the tree may have more than one parent, with the goal of eliminating redundant sub-topics deep in the tree; and (ii) parsimonious sub-topics are manifested, by removing redundant usage of words at multiple scales. The depth and width of the tree are unbounded within the prior, with a retrospective sampler employed to adaptively infer the appropriate tree size based upon the corpus under study. Excellent quantitative results are manifested on five standard data sets, and the inferred tree structure is also found to be highly interpretable.
Cite
Text
Chen et al. "Topic Modeling with Nonparametric Markov Tree." International Conference on Machine Learning, 2011.Markdown
[Chen et al. "Topic Modeling with Nonparametric Markov Tree." International Conference on Machine Learning, 2011.](https://mlanthology.org/icml/2011/chen2011icml-topic/)BibTeX
@inproceedings{chen2011icml-topic,
title = {{Topic Modeling with Nonparametric Markov Tree}},
author = {Chen, Haojun and Dunson, David B. and Carin, Lawrence},
booktitle = {International Conference on Machine Learning},
year = {2011},
pages = {377-384},
url = {https://mlanthology.org/icml/2011/chen2011icml-topic/}
}