Semi-Supervised Max-Margin Topic Model with Manifold Posterior Regularization
Abstract
Supervised topic models leverage label information to learn discriminative latent topic representations. As collecting a fully labeled dataset is often time-consuming, semi-supervised learning is of high interest. In this paper, we present an effective semi-supervised max-margin topic model by naturally introducing manifold posterior regularization to a regularized Bayesian topic model, named LapMedLDA. The model jointly learns latent topics and a related classifier with only a small fraction of labeled documents. To perform the approximate inference, we derive an efficient stochastic gradient MCMC method. Unlike the previous semi-supervised topic models, our model adopts a tight coupling between the generative topic model and the discriminative classifier. Extensive experiments demonstrate that such tight coupling brings significant benefits in quantitative and qualitative performance.
Cite
Text
Hu et al. "Semi-Supervised Max-Margin Topic Model with Manifold Posterior Regularization." International Joint Conference on Artificial Intelligence, 2017. doi:10.24963/IJCAI.2017/259Markdown
[Hu et al. "Semi-Supervised Max-Margin Topic Model with Manifold Posterior Regularization." International Joint Conference on Artificial Intelligence, 2017.](https://mlanthology.org/ijcai/2017/hu2017ijcai-semi/) doi:10.24963/IJCAI.2017/259BibTeX
@inproceedings{hu2017ijcai-semi,
title = {{Semi-Supervised Max-Margin Topic Model with Manifold Posterior Regularization}},
author = {Hu, Wenbo and Zhu, Jun and Su, Hang and Zhuo, Jingwei and Zhang, Bo},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2017},
pages = {1865-1871},
doi = {10.24963/IJCAI.2017/259},
url = {https://mlanthology.org/ijcai/2017/hu2017ijcai-semi/}
}