Learning Sparse SVM for Feature Selection on Very High Dimensional Datasets

Abstract

A sparse representation of Support vector Machines (SVMs) with respect to input features is desirable for many applications. In this paper, by introducing a 0-1 control variable to each input feature, $l_0$-norm Sparse SVM (SSVM) is converted to a mixed integer programming (MIP) problem. Rather than directly solving this MIP, we propose an efficient cutting plane algorithm combining with multiple kernel learning to solve its convex relaxation. A global convergence proof for our method is also presented. Comprehensive experimental results on one synthetic and 10 real world datasets show that our proposed method can obtain better or competitive performance compared with existing SVM-based feature selection methods in term of sparsity and eneralization performance. Moreover, our proposed method can effectively handle large-scale and extremely high dimensional problems.

Cite

Text

Tan et al. "Learning Sparse SVM for Feature Selection on Very High Dimensional Datasets." International Conference on Machine Learning, 2010.

Markdown

[Tan et al. "Learning Sparse SVM for Feature Selection on Very High Dimensional Datasets." International Conference on Machine Learning, 2010.](https://mlanthology.org/icml/2010/tan2010icml-learning/)

BibTeX

@inproceedings{tan2010icml-learning,
  title     = {{Learning Sparse SVM for Feature Selection on Very High Dimensional Datasets}},
  author    = {Tan, Mingkui and Wang, Li and Tsang, Ivor W.},
  booktitle = {International Conference on Machine Learning},
  year      = {2010},
  pages     = {1047-1054},
  url       = {https://mlanthology.org/icml/2010/tan2010icml-learning/}
}