Open Domain Short Text Conceptualization: A Generative + Descriptive Modeling Approach
Abstract
Concepts embody the knowledge to facilitate our cognitive processes of learning. Mapping short texts to a large set of open domain concepts has gained many successful applications. In this paper, we unify the existing conceptualization methods from a Bayesian perspective, and discuss the three modeling approaches: descriptive, generative, and discriminative models. Motivated by the discussion of their advantages and shortcomings, we develop a generative + descriptive modeling approach. Our model considers term relatedness in the context, and will result in disambiguated conceptualization. We show the results of short text clustering using a news title data set and a Twitter message data set, and demonstrate the effectiveness of the developed approach compared with the state-of-the-art conceptualization and topic modeling approaches.
Cite
Text
Song et al. "Open Domain Short Text Conceptualization: A Generative + Descriptive Modeling Approach." International Joint Conference on Artificial Intelligence, 2015.Markdown
[Song et al. "Open Domain Short Text Conceptualization: A Generative + Descriptive Modeling Approach." International Joint Conference on Artificial Intelligence, 2015.](https://mlanthology.org/ijcai/2015/song2015ijcai-open/)BibTeX
@inproceedings{song2015ijcai-open,
title = {{Open Domain Short Text Conceptualization: A Generative + Descriptive Modeling Approach}},
author = {Song, Yangqiu and Wang, Shusen and Wang, Haixun},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2015},
pages = {3820-3826},
url = {https://mlanthology.org/ijcai/2015/song2015ijcai-open/}
}