Concept Labeling: Building Text Classifiers with Minimal Supervision

Abstract

The rapid construction of supervised text classification models is becoming a pervasive need across many modern applications. To reduce human-labeling bottlenecks, many new statistical paradigms (e.g., active, semi-supervised, transfer and multi-task learning) have been vigorously pursued in recent literature with varying degrees of empirical success. Concurrently, the emergence of Web 2.0 platforms in the last decade has enabled a world-wide, collaborative human effort to construct a massive ontology of concepts with very rich, detailed and accurate descriptions. In this paper we propose a new framework to extract supervisory information from such ontologies and complement it with a shift in human effort from direct labeling of examples in the domain of interest to the much more efficient identification of concept-class associations. Through empirical studies on text categorization problems using the Wikipedia ontology, we show that this shift allows very high-quality models to be immediately induced at virtually no cost.

Cite

Text

Chenthamarakshan et al. "Concept Labeling: Building Text Classifiers with Minimal Supervision." International Joint Conference on Artificial Intelligence, 2011. doi:10.5591/978-1-57735-516-8/IJCAI11-208

Markdown

[Chenthamarakshan et al. "Concept Labeling: Building Text Classifiers with Minimal Supervision." International Joint Conference on Artificial Intelligence, 2011.](https://mlanthology.org/ijcai/2011/chenthamarakshan2011ijcai-concept/) doi:10.5591/978-1-57735-516-8/IJCAI11-208

BibTeX

@inproceedings{chenthamarakshan2011ijcai-concept,
  title     = {{Concept Labeling: Building Text Classifiers with Minimal Supervision}},
  author    = {Chenthamarakshan, Vijil and Melville, Prem and Sindhwani, Vikas and Lawrence, Richard D.},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2011},
  pages     = {1225-1230},
  doi       = {10.5591/978-1-57735-516-8/IJCAI11-208},
  url       = {https://mlanthology.org/ijcai/2011/chenthamarakshan2011ijcai-concept/}
}