Nichani, Eshaan

16 publications

NeurIPS 2025 Emergence and Scaling Laws in SGD Learning of Shallow Neural Networks Yunwei Ren, Eshaan Nichani, Denny Wu, Jason D. Lee
COLT 2025 Learning Compositional Functions with Transformers from Easy-to-Hard Data Zixuan Wang, Eshaan Nichani, Alberto Bietti, Alex Damian, Daniel Hsu, Jason D Lee, Denny Wu
ICLR 2025 Learning Hierarchical Polynomials of Multiple Nonlinear Features Hengyu Fu, Zihao Wang, Eshaan Nichani, Jason D. Lee
ICLR 2025 Understanding Factual Recall in Transformers via Associative Memories Eshaan Nichani, Jason D. Lee, Alberto Bietti
ICML 2024 How Transformers Learn Causal Structure with Gradient Descent Eshaan Nichani, Alex Damian, Jason D. Lee
ICLR 2024 Learning Hierarchical Polynomials with Three-Layer Neural Networks Zihao Wang, Eshaan Nichani, Jason D. Lee
NeurIPSW 2024 Understanding Factual Recall in Transformers via Associative Memories Eshaan Nichani, Jason D. Lee, Alberto Bietti
NeurIPS 2023 Fine-Tuning Language Models with Just Forward Passes Sadhika Malladi, Tianyu Gao, Eshaan Nichani, Alex Damian, Jason Lee, Danqi Chen, Sanjeev Arora
ICMLW 2023 Fine-Tuning Language Models with Just Forward Passes Sadhika Malladi, Tianyu Gao, Eshaan Nichani, Jason D. Lee, Danqi Chen, Sanjeev Arora
ICMLW 2023 Fine-Tuning Language Models with Just Forward Passes Sadhika Malladi, Tianyu Gao, Eshaan Nichani, Alex Damian, Jason D. Lee, Danqi Chen, Sanjeev Arora
NeurIPS 2023 Provable Guarantees for Nonlinear Feature Learning in Three-Layer Neural Networks Eshaan Nichani, Alex Damian, Jason Lee
ICLR 2023 Self-Stabilization: The Implicit Bias of Gradient Descent at the Edge of Stability Alex Damian, Eshaan Nichani, Jason D. Lee
NeurIPS 2023 Smoothing the Landscape Boosts the Signal for SGD: Optimal Sample Complexity for Learning Single Index Models Alex Damian, Eshaan Nichani, Rong Ge, Jason Lee
CLeaR 2022 Causal Structure Discovery Between Clusters of Nodes Induced by Latent Factors Chandler Squires, Annie Yun, Eshaan Nichani, Raj Agrawal, Caroline Uhler
NeurIPS 2022 Identifying Good Directions to Escape the NTK Regime and Efficiently Learn Low-Degree Plus Sparse Polynomials Eshaan Nichani, Yu Bai, Jason Lee
NeurIPSW 2022 Self-Stabilization: The Implicit Bias of Gradient Descent at the Edge of Stability Alex Damian, Eshaan Nichani, Jason D. Lee