Consistent Polyhedral Surrogates for Top-K Classification and Variants
Abstract
Top-$k$ classification is a generalization of multiclass classification used widely in information retrieval, image classification, and other extreme classification settings. Several hinge-like (piecewise-linear) surrogates have been proposed for the problem, yet all are either non-convex or inconsistent. For the proposed hinge-like surrogates that are convex (i.e., polyhedral), we apply the recent embedding framework of Finocchiaro et al. (2019; 2022) to determine the prediction problem for which the surrogate is consistent. These problems can all be interpreted as variants of top-$k$ classification, which may be better aligned with some applications. We leverage this analysis to derive constraints on the conditional label distributions under which these proposed surrogates become consistent for top-$k$. It has been further suggested that every convex hinge-like surrogate must be inconsistent for top-$k$. Yet, we use the same embedding framework to give the first consistent polyhedral surrogate for this problem.
Cite
Text
Thilagar et al. "Consistent Polyhedral Surrogates for Top-K Classification and Variants." International Conference on Machine Learning, 2022.Markdown
[Thilagar et al. "Consistent Polyhedral Surrogates for Top-K Classification and Variants." International Conference on Machine Learning, 2022.](https://mlanthology.org/icml/2022/thilagar2022icml-consistent/)BibTeX
@inproceedings{thilagar2022icml-consistent,
title = {{Consistent Polyhedral Surrogates for Top-K Classification and Variants}},
author = {Thilagar, Anish and Frongillo, Rafael and Finocchiaro, Jessica J and Goodwill, Emma},
booktitle = {International Conference on Machine Learning},
year = {2022},
pages = {21329-21359},
volume = {162},
url = {https://mlanthology.org/icml/2022/thilagar2022icml-consistent/}
}