A Parallel Mixture of SVMs for Very Large Scale Problems

Collobert, Ronan; Bengio, Samy; Bengio, Yoshua

doi:10.1162/089976602753633402

A Parallel Mixture of SVMs for Very Large Scale Problems

Ronan Collobert, Samy Bengio, Yoshua Bengio

NeCo 2002 pp. 1105-1114

doi:10.1162/089976602753633402 /neco/2002/collobert2002neco-parallel/

Abstract

Support vector machines (SVMs) are the state-of-the-art models for many classification problems, but they suffer from the complexity of their training algorithm, which is at least quadratic with respect to the number of examples. Hence, it is hopeless to try to solve real-life problems having more than a few hundred thousand examples with SVMs. This article proposes a new mixture of SVMs that can be easily implemented in parallel and where each SVM is trained on a small subset of the whole data set. Experiments on a large benchmark data set (Forest) yielded significant time improvement (time complexity appears empirically to locally grow linearly with the number of examples). In addition, and surprisingly, a significant improvement in generalization was observed.

PDF NeCo Semantic Scholar

Cite

Text

Collobert et al. "A Parallel Mixture of SVMs for Very Large Scale Problems." Neural Computation, 2002. doi:10.1162/089976602753633402

Markdown

[Collobert et al. "A Parallel Mixture of SVMs for Very Large Scale Problems." Neural Computation, 2002.](https://mlanthology.org/neco/2002/collobert2002neco-parallel/) doi:10.1162/089976602753633402

BibTeX

@article{collobert2002neco-parallel,
  title     = {{A Parallel Mixture of SVMs for Very Large Scale Problems}},
  author    = {Collobert, Ronan and Bengio, Samy and Bengio, Yoshua},
  journal   = {Neural Computation},
  year      = {2002},
  pages     = {1105-1114},
  doi       = {10.1162/089976602753633402},
  volume    = {14},
  url       = {https://mlanthology.org/neco/2002/collobert2002neco-parallel/}
}