Improving Rocchio with Weakly Supervised Clustering
Abstract
This paper presents a novel approach for adapting the complexity of a text categorization system to the difficulty of the task. In this study, we adapt a simple text classifier (Rocchio), using weakly supervised clustering techniques. The idea is to identify sub-topics of the original classes which can help improve the categorization process. To this end, we propose several clustering algorithms, and report results of various evaluations on standard benchmark corpora such as the Newsgroups corpus.
Cite
Text
Vinot and Yvon. "Improving Rocchio with Weakly Supervised Clustering." European Conference on Machine Learning, 2003. doi:10.1007/978-3-540-39857-8_41Markdown
[Vinot and Yvon. "Improving Rocchio with Weakly Supervised Clustering." European Conference on Machine Learning, 2003.](https://mlanthology.org/ecmlpkdd/2003/vinot2003ecml-improving/) doi:10.1007/978-3-540-39857-8_41BibTeX
@inproceedings{vinot2003ecml-improving,
title = {{Improving Rocchio with Weakly Supervised Clustering}},
author = {Vinot, Romain and Yvon, François},
booktitle = {European Conference on Machine Learning},
year = {2003},
pages = {456-467},
doi = {10.1007/978-3-540-39857-8_41},
url = {https://mlanthology.org/ecmlpkdd/2003/vinot2003ecml-improving/}
}