Learning Hierarchical Multi-Category Text Classification Models

Abstract

We present a kernel-based algorithm for hierarchical text classification where the documents are allowed to belong to more than one category at a time. The classification model is a variant of the Maximum Margin Markov Network framework, where the classification hierarchy is represented as a Markov tree equipped with an exponential family defined on the edges. We present an efficient optimization algorithm based on incremental conditional gradient ascent in single-example subspaces spanned by the marginal dual variables. Experiments show that the algorithm can feasibly optimize training sets of thousands of examples and classification hierarchies consisting of hundreds of nodes. The algorithm's predictive accuracy is competitive with other recently introduced hierarchical multi-category or multilabel classification learning algorithms.

Cite

Text

Rousu et al. "Learning Hierarchical Multi-Category Text Classification Models." International Conference on Machine Learning, 2005. doi:10.1145/1102351.1102445

Markdown

[Rousu et al. "Learning Hierarchical Multi-Category Text Classification Models." International Conference on Machine Learning, 2005.](https://mlanthology.org/icml/2005/rousu2005icml-learning/) doi:10.1145/1102351.1102445

BibTeX

@inproceedings{rousu2005icml-learning,
  title     = {{Learning Hierarchical Multi-Category Text Classification Models}},
  author    = {Rousu, Juho and Saunders, Craig and Szedmák, Sándor and Shawe-Taylor, John},
  booktitle = {International Conference on Machine Learning},
  year      = {2005},
  pages     = {744-751},
  doi       = {10.1145/1102351.1102445},
  url       = {https://mlanthology.org/icml/2005/rousu2005icml-learning/}
}