Fast Transpose Methods for Kernel Learning on Sparse Data

Abstract

Kernel-based learning algorithms, such as Support Vector Machines (SVMs) or Perceptron, often rely on sequential optimization where a few examples are added at each iteration. Updating the kernel matrix usually requires matrix-vector multiplications. We propose a new method based on transposition to speedup this computation on sparse data. Instead of dot-products over sparse feature vectors, our computation incrementally merges lists of training examples and minimizes access to the data. Caching and shrinking are also optimized for sparsity. On very large natural language tasks (tagging, translation, text classification) with sparse feature representations, a 20 to 80-fold speedup over LIBSVM is observed using the same SMO algorithm. Theory and experiments explain what type of sparsity structure is needed for this approach to work, and why its adaptation to Maxent sequential optimization is inefficient.

Cite

Text

Haffner. "Fast Transpose Methods for Kernel Learning on Sparse Data." International Conference on Machine Learning, 2006. doi:10.1145/1143844.1143893

Markdown

[Haffner. "Fast Transpose Methods for Kernel Learning on Sparse Data." International Conference on Machine Learning, 2006.](https://mlanthology.org/icml/2006/haffner2006icml-fast/) doi:10.1145/1143844.1143893

BibTeX

@inproceedings{haffner2006icml-fast,
  title     = {{Fast Transpose Methods for Kernel Learning on Sparse Data}},
  author    = {Haffner, Patrick},
  booktitle = {International Conference on Machine Learning},
  year      = {2006},
  pages     = {385-392},
  doi       = {10.1145/1143844.1143893},
  url       = {https://mlanthology.org/icml/2006/haffner2006icml-fast/}
}