Spectral Methods for Supervised Topic Models
Abstract
Supervised topic models simultaneously model the latent topic structure of large collections of documents and a response variable associated with each document. Existing inference methods are based on either variational approximation or Monte Carlo sampling. This paper presents a novel spectral decomposition algorithm to recover the parameters of supervised latent Dirichlet allocation (sLDA) models. The Spectral-sLDA algorithm is provably correct and computationally efficient. We prove a sample complexity bound and subsequently derive a sufficient condition for the identifiability of sLDA. Thorough experiments on a diverse range of synthetic and real-world datasets verify the theory and demonstrate the practical effectiveness of the algorithm.
Cite
Text
Wang and Zhu. "Spectral Methods for Supervised Topic Models." Neural Information Processing Systems, 2014.Markdown
[Wang and Zhu. "Spectral Methods for Supervised Topic Models." Neural Information Processing Systems, 2014.](https://mlanthology.org/neurips/2014/wang2014neurips-spectral/)BibTeX
@inproceedings{wang2014neurips-spectral,
title = {{Spectral Methods for Supervised Topic Models}},
author = {Wang, Yining and Zhu, Jun},
booktitle = {Neural Information Processing Systems},
year = {2014},
pages = {1511-1519},
url = {https://mlanthology.org/neurips/2014/wang2014neurips-spectral/}
}