Active Model Selection: A Variance Minimization Approach

Abstract

The cost of labeling is a significant challenge in practical machine learning. This issue arises not only during the learning phase but also at the model evaluation phase, as there is a need for a substantial amount of labeled test data in addition to the training data. In this study, we address the challenge of active model selection with the goal of minimizing labeling costs for choosing the best-performing model from a set of model candidates. Based on an appropriate test loss estimator, we propose an adaptive labeling strategy that can estimate the difference of test losses with small variance, thereby enabling the estimation of the best model using fewer labeling cost. Experimental results on real-world datasets confirm that our method efficiently selects the best model.

Cite

Text

Hara et al. "Active Model Selection: A Variance Minimization Approach." Machine Learning, 2024. doi:10.1007/S10994-024-06603-1

Markdown

[Hara et al. "Active Model Selection: A Variance Minimization Approach." Machine Learning, 2024.](https://mlanthology.org/mlj/2024/hara2024mlj-active/) doi:10.1007/S10994-024-06603-1

BibTeX

@article{hara2024mlj-active,
  title     = {{Active Model Selection: A Variance Minimization Approach}},
  author    = {Hara, Satoshi and Matsuura, Mitsuru and Honda, Junya and Ito, Shinji},
  journal   = {Machine Learning},
  year      = {2024},
  pages     = {8327-8345},
  doi       = {10.1007/S10994-024-06603-1},
  volume    = {113},
  url       = {https://mlanthology.org/mlj/2024/hara2024mlj-active/}
}