Spherical Topic Models
Abstract
We introduce the Spherical Admixture Model (SAM), a Bayesian topic model for arbitrary L2 normalized data. SAM maintains the same hierarchical structure as Latent Dirichlet Allocation (LDA), but models documents as points on a high-dimensional spherical manifold, allowing a natural likelihood parameterization in terms of cosine distance. Furthermore, SAM topics are capable of assigning negative weight to terms and can model word absence/presence unlike previous models. Performance is evaluated empirically both subjectively as a topic model using human raters and across several disparate classification tasks, from natural language processing and computer vision.
Cite
Text
Reisinger et al. "Spherical Topic Models." International Conference on Machine Learning, 2010.Markdown
[Reisinger et al. "Spherical Topic Models." International Conference on Machine Learning, 2010.](https://mlanthology.org/icml/2010/reisinger2010icml-spherical/)BibTeX
@inproceedings{reisinger2010icml-spherical,
title = {{Spherical Topic Models}},
author = {Reisinger, Joseph and Waters, Austin and Silverthorn, Bryan and Mooney, Raymond J.},
booktitle = {International Conference on Machine Learning},
year = {2010},
pages = {903-910},
url = {https://mlanthology.org/icml/2010/reisinger2010icml-spherical/}
}