Document Representation for One-Class SVM

Abstract

Previous studies have shown that one-class SVM is a rather weak learning method for text categorization problems. This paper points out that the poor performance observed before is largely due to the fact that the standard term weighting schemes are inadequate for one-class SVMs. We propose several representation modifications, and demonstrate empirically that, with the proposed document representation, the performance of one-class SVM, although trained on only small portion of positive examples, can reach up to 95% of that of two-class SVM trained on the whole labeled dataset.

Cite

Text

Wu et al. "Document Representation for One-Class SVM." European Conference on Machine Learning, 2004. doi:10.1007/978-3-540-30115-8_45

Markdown

[Wu et al. "Document Representation for One-Class SVM." European Conference on Machine Learning, 2004.](https://mlanthology.org/ecmlpkdd/2004/wu2004ecml-document/) doi:10.1007/978-3-540-30115-8_45

BibTeX

@inproceedings{wu2004ecml-document,
  title     = {{Document Representation for One-Class SVM}},
  author    = {Wu, Xiaoyun and Srihari, Rohini K. and Zheng, Zhaohui},
  booktitle = {European Conference on Machine Learning},
  year      = {2004},
  pages     = {489-500},
  doi       = {10.1007/978-3-540-30115-8_45},
  url       = {https://mlanthology.org/ecmlpkdd/2004/wu2004ecml-document/}
}