Online and Batch Learning of Pseudo-Metrics

Abstract

We describe and analyze an online algorithm for supervised learning of pseudo-metrics. The algorithm receives pairs of instances and predicts their similarity according to a pseudo-metric. The pseudo-metrics we use are quadratic forms parameterized by positive semi-definite matrices. The core of the algorithm is an update rule that is based on successive projections onto the positive semi-definite cone and onto half-space constraints imposed by the examples. We describe an efficient procedure for performing these projections, derive a worst case mistake bound on the similarity predictions, and discuss a dual version of the algorithm in which it is simple to incorporate kernel operators. The online algorithm also serves as a building block for deriving a large-margin batch algorithm. We demonstrate the merits of the proposed approach by conducting experiments on MNIST dataset and on document filtering.

Cite

Text

Shalev-Shwartz et al. "Online and Batch Learning of Pseudo-Metrics." International Conference on Machine Learning, 2004. doi:10.1145/1015330.1015376

Markdown

[Shalev-Shwartz et al. "Online and Batch Learning of Pseudo-Metrics." International Conference on Machine Learning, 2004.](https://mlanthology.org/icml/2004/shalevshwartz2004icml-online/) doi:10.1145/1015330.1015376

BibTeX

@inproceedings{shalevshwartz2004icml-online,
  title     = {{Online and Batch Learning of Pseudo-Metrics}},
  author    = {Shalev-Shwartz, Shai and Singer, Yoram and Ng, Andrew Y.},
  booktitle = {International Conference on Machine Learning},
  year      = {2004},
  doi       = {10.1145/1015330.1015376},
  url       = {https://mlanthology.org/icml/2004/shalevshwartz2004icml-online/}
}