Nayak, Anuj K.

1 publications

NeurIPSW 2024 An Information Theory of Compute-Optimal Size Scaling, Emergence, and Plateaus in Language Models Anuj K. Nayak, Lav R. Varshney