Multiple-Prediction-Powered Inference

Abstract

Statistical estimation often involves tradeoffs between expensive, high-quality measurements and a variety of lower-quality proxies. We introduce Multiple-Prediction-Powered Inference (MultiPPI): a general framework for constructing statistically efficient estimates by optimally allocating resources across these diverse data sources. This work provides theoretical guarantees about the minimax optimality, finite-sample performance, and asymptotic normality of the MultiPPI estimator, and through experiments across three diverse large language model (LLM) evaluation scenarios, we show that MultiPPI consistently achieves lower estimation error than existing baselines. This advantage stems from its budget-adaptive allocation strategy, which strategically combines subsets of models by learning their complex cost and correlation structures.

Cite

Text

Cowen-Breen et al. "Multiple-Prediction-Powered Inference." International Conference on Learning Representations, 2026.

Markdown

[Cowen-Breen et al. "Multiple-Prediction-Powered Inference." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/cowenbreen2026iclr-multiplepredictionpowered/)

BibTeX

@inproceedings{cowenbreen2026iclr-multiplepredictionpowered,
  title     = {{Multiple-Prediction-Powered Inference}},
  author    = {Cowen-Breen, Charlie and Agarwal, Alekh and Bates, Stephen and Cohen, William W. and Eisenstein, Jacob and Globerson, Amir and Fisch, Adam},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/cowenbreen2026iclr-multiplepredictionpowered/}
}