Relating Romanized Comments to News Articles by Inferring Multi-Glyphic Topical Correspondence
Abstract
Commenting is a popular facility provided by news sites. Analyzing such user-generated content has recently attracted research interest. However, in multilingual societies such as India, analyzing such user-generated content is hard due to several reasons: (1) There are more than 20 official languages but linguistic resources are available mainly for Hindi. It is observed that people frequently use romanized text as it is easy and quick using an English keyboard, resulting in multi-glyphic comments, where the texts are in the same language but in different scripts. Such romanized texts are almost unexplored in machine learning so far. (2) In many cases, comments are made on a specific part of the article rather than the topic of the entire article. Off-the-shelf methods such as correspondence LDA are insufficient to model such relationships between articles and comments. In this paper, we extend the notion of correspondence to model multi-lingual, multi-script, and inter-lingual topics in a unified probabilistic model called the Multi-glyphic Correspondence Topic Model (MCTM). Using several metrics, we verify our approach and show that it improves over the state-of-the-art.
Cite
Text
Tholpadi et al. "Relating Romanized Comments to News Articles by Inferring Multi-Glyphic Topical Correspondence." AAAI Conference on Artificial Intelligence, 2015. doi:10.1609/AAAI.V29I1.9173Markdown
[Tholpadi et al. "Relating Romanized Comments to News Articles by Inferring Multi-Glyphic Topical Correspondence." AAAI Conference on Artificial Intelligence, 2015.](https://mlanthology.org/aaai/2015/tholpadi2015aaai-relating/) doi:10.1609/AAAI.V29I1.9173BibTeX
@inproceedings{tholpadi2015aaai-relating,
title = {{Relating Romanized Comments to News Articles by Inferring Multi-Glyphic Topical Correspondence}},
author = {Tholpadi, Goutham and Das, Mrinal Kanti and Bansal, Trapit and Bhattacharyya, Chiranjib},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2015},
pages = {311-317},
doi = {10.1609/AAAI.V29I1.9173},
url = {https://mlanthology.org/aaai/2015/tholpadi2015aaai-relating/}
}