Online Inference for the Infinite Topic-Cluster Model: Storylines from Streaming Text
Abstract
We present the time-dependent topic-cluster model, a hierarchical approach for combining Latent Dirichlet Allocation and clustering via the Recurrent Chinese Restaurant Process. It inherits the advantages of both of its constituents, namely interpretability and concise representation. We show how it can be applied to streaming collections of objects such as real world feeds in a news portal. We provide details of a parallel Sequential Monte Carlo algorithm to perform inference in the resulting graphical model which scales to hundred of thousands of documents.
Cite
Text
Ahmed et al. "Online Inference for the Infinite Topic-Cluster Model: Storylines from Streaming Text." Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011.Markdown
[Ahmed et al. "Online Inference for the Infinite Topic-Cluster Model: Storylines from Streaming Text." Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011.](https://mlanthology.org/aistats/2011/ahmed2011aistats-online/)BibTeX
@inproceedings{ahmed2011aistats-online,
title = {{Online Inference for the Infinite Topic-Cluster Model: Storylines from Streaming Text}},
author = {Ahmed, Amr and Ho, Qirong and Teo, Choon Hui and Eisenstein, Jacob and Smola, Alex and Xing, Eric},
booktitle = {Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics},
year = {2011},
pages = {101-109},
volume = {15},
url = {https://mlanthology.org/aistats/2011/ahmed2011aistats-online/}
}