A Wikipedia Based Semantic Graph Model for Topic Tracking in Blogsphere
Abstract
There are two key issues for information diffusion in blogosphere: (1) blog posts are usually short, noisy and contain multiple themes, (2) information diffusion through blogosphere is primarily driven by the “word-of-mouth” effect, thus making topics evolve very fast. This paper presents a novel topic tracking approach to deal with these issues by modeling a topic as a semantic graph in which the semantic relatedness between terms are learned from Wikipedia. For a given topic/post, the named entities, Wikipedia concepts, and the semantic relatedness are extracted to generate the graph model. Noises are filtered out through a graph clustering algorithm. To handle topic evolution, the topic model is enriched by using Wikipedia as background knowledge. Furthermore, graph edit distance is used to measure the similarity between a topic and its posts. The proposed method is tested using real-world blog data. Experimental results show the advantage of the proposed method on tracking topics in short, noisy text.
Cite
Text
Tang et al. "A Wikipedia Based Semantic Graph Model for Topic Tracking in Blogsphere." International Joint Conference on Artificial Intelligence, 2011. doi:10.5591/978-1-57735-516-8/IJCAI11-389Markdown
[Tang et al. "A Wikipedia Based Semantic Graph Model for Topic Tracking in Blogsphere." International Joint Conference on Artificial Intelligence, 2011.](https://mlanthology.org/ijcai/2011/tang2011ijcai-wikipedia/) doi:10.5591/978-1-57735-516-8/IJCAI11-389BibTeX
@inproceedings{tang2011ijcai-wikipedia,
title = {{A Wikipedia Based Semantic Graph Model for Topic Tracking in Blogsphere}},
author = {Tang, Jintao and Wang, Ting and Lu, Qin and Wang, Ji and Li, Wenjie},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2011},
pages = {2337-2342},
doi = {10.5591/978-1-57735-516-8/IJCAI11-389},
url = {https://mlanthology.org/ijcai/2011/tang2011ijcai-wikipedia/}
}