Zhang, Bin Claire

2 publications

ICLR 2026 Scaling with Collapse: Efficient and Predictable Training of LLM Families Shane Bergsma, Bin Claire Zhang, Nolan Simran Dey, Shaheer Muhammad, Gurpreet Gosal, Joel Hestness
NeurIPS 2025 Don't Be Lazy: CompleteP Enables Compute-Efficient Deep Transformers Nolan Simran Dey, Bin Claire Zhang, Lorenzo Noci, Mufan Li, Blake Bordelon, Shane Bergsma, Cengiz Pehlevan, Boris Hanin, Joel Hestness