ML Anthology
Authors
Search
About
Panigrahi, Abhishek
18 publications
ICLR
2025
Efficient Stagewise Pretraining via Progressive Subnetworks
Abhishek Panigrahi
,
Nikunj Saunshi
,
Kaifeng Lyu
,
Sobhan Miryoosefi
,
Sashank J. Reddi
,
Satyen Kale
,
Sanjiv Kumar
ICML
2025
Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?
Simon Park
,
Abhishek Panigrahi
,
Yun Cheng
,
Dingli Yu
,
Anirudh Goyal
,
Sanjeev Arora
ICLRW
2025
Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?
Simon Park
,
Abhishek Panigrahi
,
Yun Cheng
,
Dingli Yu
,
Anirudh Goyal
,
Sanjeev Arora
ICML
2025
On the Power of Context-Enhanced Learning in LLMs
Xingyu Zhu
,
Abhishek Panigrahi
,
Sanjeev Arora
ICLRW
2025
On the Power of Context-Enhanced Learning in LLMs
Xingyu Zhu
,
Abhishek Panigrahi
,
Sanjeev Arora
ICLR
2025
Progressive Distillation Induces an Implicit Curriculum
Abhishek Panigrahi
,
Bingbin Liu
,
Sadhika Malladi
,
Andrej Risteski
,
Surbhi Goel
ICMLW
2024
Progressive Distillation Improves Feature Learning via Implicit Curriculum
Abhishek Panigrahi
,
Bingbin Liu
,
Sadhika Malladi
,
Andrej Risteski
,
Surbhi Goel
ICMLW
2024
Progressive Distillation Improves Feature Learning via Implicit Curriculum
Abhishek Panigrahi
,
Bingbin Liu
,
Sadhika Malladi
,
Andrej Risteski
,
Surbhi Goel
NeurIPSW
2024
Progressive Distillation Induces an Implicit Curriculum
Abhishek Panigrahi
,
Bingbin Liu
,
Sadhika Malladi
,
Andrej Risteski
,
Surbhi Goel
ICMLW
2024
Representing Rule-Based Chatbots with Transformers
Dan Friedman
,
Abhishek Panigrahi
,
Danqi Chen
ICML
2024
Trainable Transformer in Transformer
Abhishek Panigrahi
,
Sadhika Malladi
,
Mengzhou Xia
,
Sanjeev Arora
NeurIPSW
2023
Do Transformers Parse While Predicting the Masked Word?
Haoyu Zhao
,
Abhishek Panigrahi
,
Rong Ge
,
Sanjeev Arora
ICML
2023
Task-Specific Skill Localization in Fine-Tuned Language Models
Abhishek Panigrahi
,
Nikunj Saunshi
,
Haoyu Zhao
,
Sanjeev Arora
NeurIPSW
2023
Trainable Transformer in Transformer
Abhishek Panigrahi
,
Sadhika Malladi
,
Mengzhou Xia
,
Sanjeev Arora
NeurIPS
2022
On the SDEs and Scaling Rules for Adaptive Gradient Algorithms
Sadhika Malladi
,
Kaifeng Lyu
,
Abhishek Panigrahi
,
Sanjeev Arora
ICML
2022
Understanding Gradient Descent on the Edge of Stability in Deep Learning
Sanjeev Arora
,
Zhiyuan Li
,
Abhishek Panigrahi
NeurIPS
2021
Learning and Generalization in RNNs
Abhishek Panigrahi
,
Navin Goyal
ICLR
2020
Effect of Activation Functions on the Training of Overparametrized Neural Nets
Abhishek Panigrahi
,
Abhishek Shetty
,
Navin Goyal