Sample Complexity and Representation Ability of Test-Time Scaling Paradigms

Huang, Baihe; Li, Shanda; Wu, Tianhao; Yang, Yiming; Talwalkar, Ameet; Ramchandran, Kannan; Jordan, Michael I.; Jiao, Jiantao

Sample Complexity and Representation Ability of Test-Time Scaling Paradigms

Baihe Huang, Shanda Li, Tianhao Wu, Yiming Yang, Ameet Talwalkar, Kannan Ramchandran, Michael I. Jordan, Jiantao Jiao

ICLR 2026

/iclr/2026/huang2026iclr-sample/

Abstract

Test-time scaling paradigms have significantly advanced the capabilities of large language models (LLMs) on complex tasks. Despite their empirical success, theoretical understanding of the sample efficiency of various test-time strategies---such as self-consistency, best-of-$n$, and self-correction---remains limited. In this work, we first establish a separation result between two repeated sampling strategies: self-consistency requires $\Theta(1/\Delta^2)$ samples to produce the correct answer, while best-of-$n$ only needs $\Theta(1/\Delta)$, where $\Delta < 1$ denotes the probability gap between the correct and second most likely answers. Next, we present an expressiveness result for the self-correction approach with verifier feedback: it enables Transformers to simulate online learning over a pool of experts at test time. Therefore, a single Transformer architecture can provably solve multiple tasks without prior knowledge of the specific task associated with a user query, extending the representation theory of Transformers from single-task to multi-task settings. Finally, we empirically validate our theoretical results, demonstrating the practical effectiveness of self-correction methods.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Huang et al. "Sample Complexity and Representation Ability of Test-Time Scaling Paradigms." International Conference on Learning Representations, 2026.

Markdown

[Huang et al. "Sample Complexity and Representation Ability of Test-Time Scaling Paradigms." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/huang2026iclr-sample/)

BibTeX

@inproceedings{huang2026iclr-sample,
  title     = {{Sample Complexity and Representation Ability of Test-Time Scaling Paradigms}},
  author    = {Huang, Baihe and Li, Shanda and Wu, Tianhao and Yang, Yiming and Talwalkar, Ameet and Ramchandran, Kannan and Jordan, Michael I. and Jiao, Jiantao},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/huang2026iclr-sample/}
}