Multilingual Anchoring: Interactive Topic Modeling and Alignment Across Languages

Michelle Yuan, Benjamin Van Durme, Jordan L Ying

NeurIPS 2018 pp. 8653-8663

/neurips/2018/yuan2018neurips-multilingual/

Abstract

Multilingual topic models can reveal patterns in cross-lingual document collections. However, existing models lack speed and interactivity, which prevents adoption in everyday corpora exploration or quick moving situations (e.g., natural disasters, political instability). First, we propose a multilingual anchoring algorithm that builds an anchor-based topic model for documents in different languages. Then, we incorporate interactivity to develop MTAnchor (Multilingual Topic Anchors), a system that allows users to refine the topic model. We test our algorithms on labeled English, Chinese, and Sinhalese documents. Within minutes, our methods can produce interpretable topics that are useful for specific classification tasks.

PDF NeurIPS Semantic Scholar

Cite

Text

Yuan et al. "Multilingual Anchoring: Interactive Topic Modeling and Alignment Across Languages." Neural Information Processing Systems, 2018.

Markdown

[Yuan et al. "Multilingual Anchoring: Interactive Topic Modeling and Alignment Across Languages." Neural Information Processing Systems, 2018.](https://mlanthology.org/neurips/2018/yuan2018neurips-multilingual/)

BibTeX

@inproceedings{yuan2018neurips-multilingual,
  title     = {{Multilingual Anchoring: Interactive Topic Modeling and Alignment Across Languages}},
  author    = {Yuan, Michelle and Van Durme, Benjamin and Ying, Jordan L},
  booktitle = {Neural Information Processing Systems},
  year      = {2018},
  pages     = {8653-8663},
  url       = {https://mlanthology.org/neurips/2018/yuan2018neurips-multilingual/}
}