A Framework for Incorporating General Domain Knowledge into Latent Dirichlet Allocation Using First-Order Logic
Abstract
Topic models have been used successfully for a variety of problems, often in the form of application-specific extensions of the basic Latent Dirichlet Allocation (LDA) model. Because deriving these new models in order to encode domain knowledge can be difficult and time-consuming, we propose the Fold·all model, which allows the user to specify general domain knowledge in First-Order Logic (FOL). However, combining topic modeling with FOL can result in inference problems beyond the capabilities of existing techniques. We have therefore developed a scalable inference technique using stochastic gradient descent which may also be useful to the Markov Logic Network (MLN) research community. Experiments demonstrate the expresive power of Fold·all, as well as the scalability of our proposed inference method.
Cite
Text
Andrzejewski et al. "A Framework for Incorporating General Domain Knowledge into Latent Dirichlet Allocation Using First-Order Logic." International Joint Conference on Artificial Intelligence, 2011. doi:10.5591/978-1-57735-516-8/IJCAI11-200Markdown
[Andrzejewski et al. "A Framework for Incorporating General Domain Knowledge into Latent Dirichlet Allocation Using First-Order Logic." International Joint Conference on Artificial Intelligence, 2011.](https://mlanthology.org/ijcai/2011/andrzejewski2011ijcai-framework/) doi:10.5591/978-1-57735-516-8/IJCAI11-200BibTeX
@inproceedings{andrzejewski2011ijcai-framework,
title = {{A Framework for Incorporating General Domain Knowledge into Latent Dirichlet Allocation Using First-Order Logic}},
author = {Andrzejewski, David and Zhu, Xiaojin and Craven, Mark and Recht, Benjamin},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2011},
pages = {1171-1177},
doi = {10.5591/978-1-57735-516-8/IJCAI11-200},
url = {https://mlanthology.org/ijcai/2011/andrzejewski2011ijcai-framework/}
}