Consistent Polyhedral Surrogates for Top-K Classification and Variants

Abstract

Top-$k$ classification is a generalization of multiclass classification used widely in information retrieval, image classification, and other extreme classification settings. Several hinge-like (piecewise-linear) surrogates have been proposed for the problem, yet all are either non-convex or inconsistent. For the proposed hinge-like surrogates that are convex (i.e., polyhedral), we apply the recent embedding framework of Finocchiaro et al. (2019; 2022) to determine the prediction problem for which the surrogate is consistent. These problems can all be interpreted as variants of top-$k$ classification, which may be better aligned with some applications. We leverage this analysis to derive constraints on the conditional label distributions under which these proposed surrogates become consistent for top-$k$. It has been further suggested that every convex hinge-like surrogate must be inconsistent for top-$k$. Yet, we use the same embedding framework to give the first consistent polyhedral surrogate for this problem.

Cite

Text

Thilagar et al. "Consistent Polyhedral Surrogates for Top-K Classification and Variants." International Conference on Machine Learning, 2022.

Markdown

[Thilagar et al. "Consistent Polyhedral Surrogates for Top-K Classification and Variants." International Conference on Machine Learning, 2022.](https://mlanthology.org/icml/2022/thilagar2022icml-consistent/)

BibTeX

@inproceedings{thilagar2022icml-consistent,
  title     = {{Consistent Polyhedral Surrogates for Top-K Classification and Variants}},
  author    = {Thilagar, Anish and Frongillo, Rafael and Finocchiaro, Jessica J and Goodwill, Emma},
  booktitle = {International Conference on Machine Learning},
  year      = {2022},
  pages     = {21329-21359},
  volume    = {162},
  url       = {https://mlanthology.org/icml/2022/thilagar2022icml-consistent/}
}