Large-Margin Convex Polytope Machine

Abstract

We present the Convex Polytope Machine (CPM), a novel non-linear learning algorithm for large-scale binary classification tasks. The CPM finds a large margin convex polytope separator which encloses one class. We develop a stochastic gradient descent based algorithm that is amenable to massive datasets, and augment it with a heuristic procedure to avoid sub-optimal local minima. Our experimental evaluations of the CPM on large-scale datasets from distinct domains (MNIST handwritten digit recognition, text topic, and web security) demonstrate that the CPM trains models faster, sometimes several orders of magnitude, than state-of-the-art similar approaches and kernel-SVM methods while achieving comparable or better classification performance. Our empirical results suggest that, unlike prior similar approaches, we do not need to control the number of sub-classifiers (sides of the polytope) to avoid overfitting.

Cite

Text

Kantchelian et al. "Large-Margin Convex Polytope Machine." Neural Information Processing Systems, 2014.

Markdown

[Kantchelian et al. "Large-Margin Convex Polytope Machine." Neural Information Processing Systems, 2014.](https://mlanthology.org/neurips/2014/kantchelian2014neurips-largemargin/)

BibTeX

@inproceedings{kantchelian2014neurips-largemargin,
  title     = {{Large-Margin Convex Polytope Machine}},
  author    = {Kantchelian, Alex and Tschantz, Michael C and Huang, Ling and Bartlett, Peter L and Joseph, Anthony D and Tygar, J. D.},
  booktitle = {Neural Information Processing Systems},
  year      = {2014},
  pages     = {3248-3256},
  url       = {https://mlanthology.org/neurips/2014/kantchelian2014neurips-largemargin/}
}