Stabilizing Black-Box Model Selection with the Inflated Argmax

Abstract

Model selection is the process of choosing from a class of candidate models given data. For instance, methods such as the LASSO and sparse identification of nonlinear dynamics (SINDy) formulate model selection as finding a sparse solution to a linear system of equations determined by training data. However, absent strong assumptions, such methods are highly unstable: if a single data point is removed from the training set, a different model may be selected. In this paper, we present a new approach to stabilizing model selection with theoretical stability guarantees that leverages a combination of bagging and an ''inflated'' argmax operation. Our method selects a small collection of models that all fit the data, and it is stable in that, with high probability, the removal of any training point will result in a collection of selected models that overlaps with the original collection. We illustrate this method in (a) a simulation in which strongly correlated covariates make standard LASSO model selection highly unstable, (b) a Lotka–Volterra model selection problem focused on identifying how competition in an ecosystem influences species' abundances, (c) a graph subset selection problem using cell-signaling data from proteomics, and (d) unsupervised $\kappa$-means clustering. In these settings, the proposed method yields stable, compact, and accurate collections of selected models, outperforming a variety of benchmarks.

Cite

Text

Adrian et al. "Stabilizing Black-Box Model Selection with the Inflated Argmax." Transactions on Machine Learning Research, 2025.

Markdown

[Adrian et al. "Stabilizing Black-Box Model Selection with the Inflated Argmax." Transactions on Machine Learning Research, 2025.](https://mlanthology.org/tmlr/2025/adrian2025tmlr-stabilizing/)

BibTeX

@article{adrian2025tmlr-stabilizing,
  title     = {{Stabilizing Black-Box Model Selection with the Inflated Argmax}},
  author    = {Adrian, Melissa and Soloff, Jake A and Willett, Rebecca},
  journal   = {Transactions on Machine Learning Research},
  year      = {2025},
  url       = {https://mlanthology.org/tmlr/2025/adrian2025tmlr-stabilizing/}
}