Spectral Methods for Supervised Topic Models

Abstract

Supervised topic models simultaneously model the latent topic structure of large collections of documents and a response variable associated with each document. Existing inference methods are based on either variational approximation or Monte Carlo sampling. This paper presents a novel spectral decomposition algorithm to recover the parameters of supervised latent Dirichlet allocation (sLDA) models. The Spectral-sLDA algorithm is provably correct and computationally efficient. We prove a sample complexity bound and subsequently derive a sufficient condition for the identifiability of sLDA. Thorough experiments on a diverse range of synthetic and real-world datasets verify the theory and demonstrate the practical effectiveness of the algorithm.

Cite

Text

Wang and Zhu. "Spectral Methods for Supervised Topic Models." Neural Information Processing Systems, 2014.

Markdown

[Wang and Zhu. "Spectral Methods for Supervised Topic Models." Neural Information Processing Systems, 2014.](https://mlanthology.org/neurips/2014/wang2014neurips-spectral/)

BibTeX

@inproceedings{wang2014neurips-spectral,
  title     = {{Spectral Methods for Supervised Topic Models}},
  author    = {Wang, Yining and Zhu, Jun},
  booktitle = {Neural Information Processing Systems},
  year      = {2014},
  pages     = {1511-1519},
  url       = {https://mlanthology.org/neurips/2014/wang2014neurips-spectral/}
}