Document Representation for One-Class SVM
Abstract
Previous studies have shown that one-class SVM is a rather weak learning method for text categorization problems. This paper points out that the poor performance observed before is largely due to the fact that the standard term weighting schemes are inadequate for one-class SVMs. We propose several representation modifications, and demonstrate empirically that, with the proposed document representation, the performance of one-class SVM, although trained on only small portion of positive examples, can reach up to 95% of that of two-class SVM trained on the whole labeled dataset.
Cite
Text
Wu et al. "Document Representation for One-Class SVM." European Conference on Machine Learning, 2004. doi:10.1007/978-3-540-30115-8_45Markdown
[Wu et al. "Document Representation for One-Class SVM." European Conference on Machine Learning, 2004.](https://mlanthology.org/ecmlpkdd/2004/wu2004ecml-document/) doi:10.1007/978-3-540-30115-8_45BibTeX
@inproceedings{wu2004ecml-document,
title = {{Document Representation for One-Class SVM}},
author = {Wu, Xiaoyun and Srihari, Rohini K. and Zheng, Zhaohui},
booktitle = {European Conference on Machine Learning},
year = {2004},
pages = {489-500},
doi = {10.1007/978-3-540-30115-8_45},
url = {https://mlanthology.org/ecmlpkdd/2004/wu2004ecml-document/}
}