Nichani, Eshaan

16 publications

NeurIPS 2025 Emergence and Scaling Laws in SGD Learning of Shallow Neural Networks Yunwei Ren, Eshaan Nichani, Denny Wu, Jason D. Lee

COLT 2025 Learning Compositional Functions with Transformers from Easy-to-Hard Data Zixuan Wang, Eshaan Nichani, Alberto Bietti, Alex Damian, Daniel Hsu, Jason D Lee, Denny Wu

ICLR 2025 Learning Hierarchical Polynomials of Multiple Nonlinear Features Hengyu Fu, Zihao Wang, Eshaan Nichani, Jason D. Lee

ICLR 2025 Understanding Factual Recall in Transformers via Associative Memories Eshaan Nichani, Jason D. Lee, Alberto Bietti

ICML 2024 How Transformers Learn Causal Structure with Gradient Descent Eshaan Nichani, Alex Damian, Jason D. Lee

ICLR 2024 Learning Hierarchical Polynomials with Three-Layer Neural Networks Zihao Wang, Eshaan Nichani, Jason D. Lee

NeurIPSW 2024 Understanding Factual Recall in Transformers via Associative Memories Eshaan Nichani, Jason D. Lee, Alberto Bietti

NeurIPS 2023 Fine-Tuning Language Models with Just Forward Passes Sadhika Malladi, Tianyu Gao, Eshaan Nichani, Alex Damian, Jason Lee, Danqi Chen, Sanjeev Arora

ICMLW 2023 Fine-Tuning Language Models with Just Forward Passes Sadhika Malladi, Tianyu Gao, Eshaan Nichani, Jason D. Lee, Danqi Chen, Sanjeev Arora

ICMLW 2023 Fine-Tuning Language Models with Just Forward Passes Sadhika Malladi, Tianyu Gao, Eshaan Nichani, Alex Damian, Jason D. Lee, Danqi Chen, Sanjeev Arora

NeurIPS 2023 Provable Guarantees for Nonlinear Feature Learning in Three-Layer Neural Networks Eshaan Nichani, Alex Damian, Jason Lee

ICLR 2023 Self-Stabilization: The Implicit Bias of Gradient Descent at the Edge of Stability Alex Damian, Eshaan Nichani, Jason D. Lee

NeurIPS 2023 Smoothing the Landscape Boosts the Signal for SGD: Optimal Sample Complexity for Learning Single Index Models Alex Damian, Eshaan Nichani, Rong Ge, Jason Lee

CLeaR 2022 Causal Structure Discovery Between Clusters of Nodes Induced by Latent Factors Chandler Squires, Annie Yun, Eshaan Nichani, Raj Agrawal, Caroline Uhler

NeurIPS 2022 Identifying Good Directions to Escape the NTK Regime and Efficiently Learn Low-Degree Plus Sparse Polynomials Eshaan Nichani, Yu Bai, Jason Lee

NeurIPSW 2022 Self-Stabilization: The Implicit Bias of Gradient Descent at the Edge of Stability Alex Damian, Eshaan Nichani, Jason D. Lee