Credit-Budgeted ICPC-Style Coding: When Agents Must Pay for Every Decision

Abstract

As autonomous agents and agent swarms become increasingly capable at complex coding tasks, the focus must shift from mere accuracy to real-world efficiency. Unconstrained agents often waste massive resources and time, incurring high opportunity costs. To be practical, agents must manage a generalized "credit" budget covering generated tokens, local tests, and elapsed time. To evaluate this resource awareness, we introduce USACOArena, an interactive ACM-ICPC style arena where every decision consumes credits from a fixed pool. This transforms a static coding benchmark into a dynamic resource management challenge. Our experiments reveal that frontier models—including the Codex framework and leading single agents—struggle to optimally balance these economic constraints. Through competitive profiling and self-play, we uncover distinct, path-dependent decision strategies. Ultimately, enforcing a strict credit economy is essential for developing highly efficient, cost-aware agent swarms.

Cite

Text

Zhou et al. "Credit-Budgeted ICPC-Style Coding: When Agents Must Pay for Every Decision." International Conference on Learning Representations, 2026.

Markdown

[Zhou et al. "Credit-Budgeted ICPC-Style Coding: When Agents Must Pay for Every Decision." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/zhou2026iclr-creditbudgeted/)

BibTeX

@inproceedings{zhou2026iclr-creditbudgeted,
  title     = {{Credit-Budgeted ICPC-Style Coding: When Agents Must Pay for Every Decision}},
  author    = {Zhou, Lingfeng and Shi, Junhao and Gao, Jin and Wang, Dequan},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/zhou2026iclr-creditbudgeted/}
}