Learning Shrinks the Hard Tail: Training‑Dependent Inference Scaling in a Solvable Linear Model

Levi, Noam Itzhak

Learning Shrinks the Hard Tail: Training‑Dependent Inference Scaling in a Solvable Linear Model

ICLR 2026

/iclr/2026/levi2026iclr-learning/

Abstract

We analyze neural scaling laws in a solvable model of last-layer fine-tuning where targets have intrinsic, instance-heterogeneous difficulty. In our Latent Instance Difficulty (LID) model, each input's target variance is governed by a latent ''precision'' drawn from a heavy-tailed distribution. While generalization loss recovers standard scaling laws, our main contribution connects this to inference. The pass@$k$ failure rate exhibits a power-law decay, $k^{-\beta_\mathrm{eff}}$, but the observed exponent $\beta_\mathrm{eff}$ is training-dependent. It grows with sample size $N$ before saturating at an intrinsic limit $\beta$ set by the difficulty distribution's tail. This coupling reveals that learning shrinks the ''hard tail'' of the error distribution: improvements in the model's generalization error steepen the pass@$k$ curve until irreducible target variance dominates. The LID model yields testable, closed-form predictions for this behavior, including a compute-allocation rule that favors training before saturation and inference attempts after. We validate these predictions in simulations and in two real‑data proxies: CIFAR‑10H (human‑label variance) and a maths teacher–student distillation task.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Levi. "Learning Shrinks the Hard Tail: Training‑Dependent Inference Scaling in a Solvable Linear Model." International Conference on Learning Representations, 2026.

Markdown

[Levi. "Learning Shrinks the Hard Tail: Training‑Dependent Inference Scaling in a Solvable Linear Model." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/levi2026iclr-learning/)

BibTeX

@inproceedings{levi2026iclr-learning,
  title     = {{Learning Shrinks the Hard Tail: Training‑Dependent Inference Scaling in a Solvable Linear Model}},
  author    = {Levi, Noam Itzhak},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/levi2026iclr-learning/}
}