Refinery: An Open Source Topic Modeling Web Platform

Abstract

We introduce Refinery, an open source platform for exploring large text document collections with topic models. Refinery is a standalone web application driven by a graphical interface, so it is usable by those without machine learning or programming expertise. Users can interactively organize articles by topic and also refine this organization with phrase-level analysis. Under the hood, we train Bayesian nonparametric topic models that can adapt model complexity to the provided data with scalable learning algorithms. The project website contains Python code and further documentation.

Cite

Text

Kim et al. "Refinery: An Open Source Topic Modeling Web Platform." Machine Learning Open Source Software, 2017.

Markdown

[Kim et al. "Refinery: An Open Source Topic Modeling Web Platform." Machine Learning Open Source Software, 2017.](https://mlanthology.org/mloss/2017/kim2017jmlr-refinery/)

BibTeX

@article{kim2017jmlr-refinery,
  title     = {{Refinery: An Open Source Topic Modeling Web Platform}},
  author    = {Kim, Daeil and Swanson, Benjamin F. and Hughes, Michael C. and Sudderth, Erik B.},
  journal   = {Machine Learning Open Source Software},
  year      = {2017},
  pages     = {1-5},
  volume    = {18},
  url       = {https://mlanthology.org/mloss/2017/kim2017jmlr-refinery/}
}