Muhammad, Shaheer

1 publications

ICLR 2026 Scaling with Collapse: Efficient and Predictable Training of LLM Families Shane Bergsma, Bin Claire Zhang, Nolan Simran Dey, Shaheer Muhammad, Gurpreet Gosal, Joel Hestness