Bayesian Neural Scaling Law Extrapolation with Prior-Data Fitted Networks

Abstract

Scaling has been a major driver of recent advancements in deep learning. Numerous empirical studies have found that scaling laws often follow the power-law and proposed several variants of power-law functions to predict the scaling behavior at larger scales. However, existing methods mostly rely on point estimation and do not quantify uncertainty, which is crucial for real-world applications involving decision-making problems such as determining the expected performance improvements achievable by investing additional computational resources. In this work, we explore a Bayesian framework based on Prior-data Fitted Networks (PFNs) for neural scaling law extrapolation. Specifically, we design a prior distribution that enables the sampling of infinitely many synthetic functions resembling real-world neural scaling laws, allowing our PFN to meta-learn the extrapolation. We validate the effectiveness of our approach on real-world neural scaling laws, comparing it against both the existing point estimation methods and Bayesian approaches. Our method demonstrates superior performance, particularly in data-limited scenarios such as Bayesian active learning, underscoring its potential for reliable, uncertainty-aware extrapolation in practical applications.

Cite

Text

Lee et al. "Bayesian Neural Scaling Law Extrapolation with Prior-Data Fitted Networks." Proceedings of the 42nd International Conference on Machine Learning, 2025.

Markdown

[Lee et al. "Bayesian Neural Scaling Law Extrapolation with Prior-Data Fitted Networks." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/lee2025icml-bayesian-a/)

BibTeX

@inproceedings{lee2025icml-bayesian-a,
  title     = {{Bayesian Neural Scaling Law Extrapolation with Prior-Data Fitted Networks}},
  author    = {Lee, Dongwoo and Lee, Dong Bok and Adriaensen, Steven and Lee, Juho and Hwang, Sung Ju and Hutter, Frank and Kim, Seon Joo and Lee, Hae Beom},
  booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
  year      = {2025},
  pages     = {33243-33275},
  volume    = {267},
  url       = {https://mlanthology.org/icml/2025/lee2025icml-bayesian-a/}
}