Evaluating Feature Selection for SVMs in High Dimensions

Abstract

We perform a systematic evaluation of feature selection (FS) methods for support vector machines (SVMs) using simulated high- dimensional data (up to 5000 dimensions). Several findings previously reported at low dimensions do not apply in high dimensions. For example, none of the FS methods investigated improved SVM accuracy, indicating that the SVM built-in regularization is sufficient. These results were also validated using microarray data. Moreover, all FS methods tend to discard many relevant features. This is a problem for applications such as microarray data analysis, where identifying all biologically important features is a major objective.

Cite

Text

Nilsson et al. "Evaluating Feature Selection for SVMs in High Dimensions." European Conference on Machine Learning, 2006. doi:10.1007/11871842_72

Markdown

[Nilsson et al. "Evaluating Feature Selection for SVMs in High Dimensions." European Conference on Machine Learning, 2006.](https://mlanthology.org/ecmlpkdd/2006/nilsson2006ecml-evaluating/) doi:10.1007/11871842_72

BibTeX

@inproceedings{nilsson2006ecml-evaluating,
  title     = {{Evaluating Feature Selection for SVMs in High Dimensions}},
  author    = {Nilsson, Roland and Peña, José M. and Björkegren, Johan and Tegnér, Jesper},
  booktitle = {European Conference on Machine Learning},
  year      = {2006},
  pages     = {719-726},
  doi       = {10.1007/11871842_72},
  url       = {https://mlanthology.org/ecmlpkdd/2006/nilsson2006ecml-evaluating/}
}