Progressive EM for Latent Tree Models and Hierarchical Topic Detection

Abstract

Hierarchical latent tree analysis (HLTA) is recently proposed as a new method for topic detection. It differs fundamentally from the LDA-based methods in terms of topic definition, topic-document relationship, and learning method. It has been shown to discover significantly more coherent topics and better topic hierarchies. However, HLTA relies on the Expectation-Maximization (EM) algorithm for parameter estimation and hence is not efficient enough to deal with large datasets. In this paper, we propose a method to drastically speed up HLTA using a technique inspired by the advances in the method of moments. Empirical experiments show that our method greatly improves the efficiency of HLTA. It is as efficient as the state-of-the-art LDA-based method for hierarchical topic detection and finds substantially better topics and topic hierarchies.

Cite

Text

Chen et al. "Progressive EM for Latent Tree Models and Hierarchical Topic Detection." AAAI Conference on Artificial Intelligence, 2016. doi:10.1609/AAAI.V30I1.10196

Markdown

[Chen et al. "Progressive EM for Latent Tree Models and Hierarchical Topic Detection." AAAI Conference on Artificial Intelligence, 2016.](https://mlanthology.org/aaai/2016/chen2016aaai-progressive/) doi:10.1609/AAAI.V30I1.10196

BibTeX

@inproceedings{chen2016aaai-progressive,
  title     = {{Progressive EM for Latent Tree Models and Hierarchical Topic Detection}},
  author    = {Chen, Peixian and Zhang, Nevin L. and Poon, Leonard K. M. and Chen, Zhourong},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2016},
  pages     = {1498-1504},
  doi       = {10.1609/AAAI.V30I1.10196},
  url       = {https://mlanthology.org/aaai/2016/chen2016aaai-progressive/}
}