Improving Rocchio with Weakly Supervised Clustering

Abstract

This paper presents a novel approach for adapting the complexity of a text categorization system to the difficulty of the task. In this study, we adapt a simple text classifier (Rocchio), using weakly supervised clustering techniques. The idea is to identify sub-topics of the original classes which can help improve the categorization process. To this end, we propose several clustering algorithms, and report results of various evaluations on standard benchmark corpora such as the Newsgroups corpus.

Cite

Text

Vinot and Yvon. "Improving Rocchio with Weakly Supervised Clustering." European Conference on Machine Learning, 2003. doi:10.1007/978-3-540-39857-8_41

Markdown

[Vinot and Yvon. "Improving Rocchio with Weakly Supervised Clustering." European Conference on Machine Learning, 2003.](https://mlanthology.org/ecmlpkdd/2003/vinot2003ecml-improving/) doi:10.1007/978-3-540-39857-8_41

BibTeX

@inproceedings{vinot2003ecml-improving,
  title     = {{Improving Rocchio with Weakly Supervised Clustering}},
  author    = {Vinot, Romain and Yvon, François},
  booktitle = {European Conference on Machine Learning},
  year      = {2003},
  pages     = {456-467},
  doi       = {10.1007/978-3-540-39857-8_41},
  url       = {https://mlanthology.org/ecmlpkdd/2003/vinot2003ecml-improving/}
}