Panigrahi, Abhishek

18 publications

ICLR 2025 Efficient Stagewise Pretraining via Progressive Subnetworks Abhishek Panigrahi, Nikunj Saunshi, Kaifeng Lyu, Sobhan Miryoosefi, Sashank J. Reddi, Satyen Kale, Sanjiv Kumar
ICML 2025 Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs? Simon Park, Abhishek Panigrahi, Yun Cheng, Dingli Yu, Anirudh Goyal, Sanjeev Arora
ICLRW 2025 Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs? Simon Park, Abhishek Panigrahi, Yun Cheng, Dingli Yu, Anirudh Goyal, Sanjeev Arora
ICML 2025 On the Power of Context-Enhanced Learning in LLMs Xingyu Zhu, Abhishek Panigrahi, Sanjeev Arora
ICLRW 2025 On the Power of Context-Enhanced Learning in LLMs Xingyu Zhu, Abhishek Panigrahi, Sanjeev Arora
ICLR 2025 Progressive Distillation Induces an Implicit Curriculum Abhishek Panigrahi, Bingbin Liu, Sadhika Malladi, Andrej Risteski, Surbhi Goel
ICMLW 2024 Progressive Distillation Improves Feature Learning via Implicit Curriculum Abhishek Panigrahi, Bingbin Liu, Sadhika Malladi, Andrej Risteski, Surbhi Goel
ICMLW 2024 Progressive Distillation Improves Feature Learning via Implicit Curriculum Abhishek Panigrahi, Bingbin Liu, Sadhika Malladi, Andrej Risteski, Surbhi Goel
NeurIPSW 2024 Progressive Distillation Induces an Implicit Curriculum Abhishek Panigrahi, Bingbin Liu, Sadhika Malladi, Andrej Risteski, Surbhi Goel
ICMLW 2024 Representing Rule-Based Chatbots with Transformers Dan Friedman, Abhishek Panigrahi, Danqi Chen
ICML 2024 Trainable Transformer in Transformer Abhishek Panigrahi, Sadhika Malladi, Mengzhou Xia, Sanjeev Arora
NeurIPSW 2023 Do Transformers Parse While Predicting the Masked Word? Haoyu Zhao, Abhishek Panigrahi, Rong Ge, Sanjeev Arora
ICML 2023 Task-Specific Skill Localization in Fine-Tuned Language Models Abhishek Panigrahi, Nikunj Saunshi, Haoyu Zhao, Sanjeev Arora
NeurIPSW 2023 Trainable Transformer in Transformer Abhishek Panigrahi, Sadhika Malladi, Mengzhou Xia, Sanjeev Arora
NeurIPS 2022 On the SDEs and Scaling Rules for Adaptive Gradient Algorithms Sadhika Malladi, Kaifeng Lyu, Abhishek Panigrahi, Sanjeev Arora
ICML 2022 Understanding Gradient Descent on the Edge of Stability in Deep Learning Sanjeev Arora, Zhiyuan Li, Abhishek Panigrahi
NeurIPS 2021 Learning and Generalization in RNNs Abhishek Panigrahi, Navin Goyal
ICLR 2020 Effect of Activation Functions on the Training of Overparametrized Neural Nets Abhishek Panigrahi, Abhishek Shetty, Navin Goyal