Fast Label Embeddings via Randomized Linear Algebra

Abstract

Many modern multiclass and multilabel problems are characterized by increasingly large output spaces. For these problems, label embeddings have been shown to be a useful primitive that can improve computational and statistical efficiency. In this work we utilize a correspondence between rank constrained estimation and low dimensional label embeddings that uncovers a fast label embedding algorithm which works in both the multiclass and multilabel settings. The result is a randomized algorithm whose running time is exponentially faster than naive algorithms. We demonstrate our techniques on two large-scale public datasets, from the Large Scale Hierarchical Text Challenge and the Open Directory Project, where we obtain state of the art results.

Cite

Text

Mineiro and Karampatziakis. "Fast Label Embeddings via Randomized Linear Algebra." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2015. doi:10.1007/978-3-319-23528-8_3

Markdown

[Mineiro and Karampatziakis. "Fast Label Embeddings via Randomized Linear Algebra." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2015.](https://mlanthology.org/ecmlpkdd/2015/mineiro2015ecmlpkdd-fast/) doi:10.1007/978-3-319-23528-8_3

BibTeX

@inproceedings{mineiro2015ecmlpkdd-fast,
  title     = {{Fast Label Embeddings via Randomized Linear Algebra}},
  author    = {Mineiro, Paul and Karampatziakis, Nikos},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2015},
  pages     = {37-51},
  doi       = {10.1007/978-3-319-23528-8_3},
  url       = {https://mlanthology.org/ecmlpkdd/2015/mineiro2015ecmlpkdd-fast/}
}