Malladi, Sadhika

31 publications

ICLR 2025 Adaptive Data Optimization: Dynamic Sample Selection with Scaling Laws Yiding Jiang, Allan Zhou, Zhili Feng, Sadhika Malladi, J Zico Kolter
ICLR 2025 MUSE: Machine Unlearning Six-Way Evaluation for Language Models Weijia Shi, Jaechan Lee, Yangsibo Huang, Sadhika Malladi, Jieyu Zhao, Ari Holtzman, Daogao Liu, Luke Zettlemoyer, Noah A. Smith, Chiyuan Zhang
ICML 2025 Metadata Conditioning Accelerates Language Model Pre-Training Tianyu Gao, Alexander Wettig, Luxi He, Yihe Dong, Sadhika Malladi, Danqi Chen
ICML 2025 Overtrained Language Models Are Harder to Fine-Tune Jacob Mitchell Springer, Sachin Goyal, Kaiyue Wen, Tanishq Kumar, Xiang Yue, Sadhika Malladi, Graham Neubig, Aditi Raghunathan
ICLRW 2025 Overtrained Language Models Are Harder to Fine-Tune Jacob Mitchell Springer, Sachin Goyal, Kaiyue Wen, Tanishq Kumar, Xiang Yue, Sadhika Malladi, Graham Neubig, Aditi Raghunathan
ICLR 2025 Progressive Distillation Induces an Implicit Curriculum Abhishek Panigrahi, Bingbin Liu, Sadhika Malladi, Andrej Risteski, Surbhi Goel
ICLR 2025 Provable Unlearning in Topic Modeling and Downstream Tasks Stanley Wei, Sadhika Malladi, Sanjeev Arora, Amartya Sanyal
ICLR 2025 Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization Noam Razin, Sadhika Malladi, Adithya Bhaskar, Danqi Chen, Sanjeev Arora, Boris Hanin
NeurIPS 2024 CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs Zirui Wang, Mengzhou Xia, Luxi He, Howard Chen, Yitao Liu, Richard Zhu, Kaiqu Liang, Xindi Wu, Haotian Liu, Sadhika Malladi, Alexis Chevalier, Sanjeev Arora, Danqi Chen
ICML 2024 LESS: Selecting Influential Data for Targeted Instruction Tuning Mengzhou Xia, Sadhika Malladi, Suchin Gururangan, Sanjeev Arora, Danqi Chen
ICLRW 2024 LESS: Selecting Influential Data for Targeted Instruction Tuning Mengzhou Xia, Sadhika Malladi, Suchin Gururangan, Sanjeev Arora, Danqi Chen
NeurIPS 2024 Preference Learning Algorithms Do Not Learn Preference Rankings Angelica Chen, Sadhika Malladi, Lily H. Zhang, Xinyi Chen, Qiuyi Zhang, Rajesh Ranganath, Kyunghyun Cho
ICMLW 2024 Preference Learning Algorithms Do Not Learn Preference Rankings Angelica Chen, Sadhika Malladi, Lily H Zhang, Xinyi Chen, Qiuyi Zhang, Rajesh Ranganath, Kyunghyun Cho
ICMLW 2024 Preference Learning Algorithms Do Not Learn Preference Rankings Angelica Chen, Sadhika Malladi, Lily H Zhang, Xinyi Chen, Qiuyi Zhang, Rajesh Ranganath, Kyunghyun Cho
ICMLW 2024 Progressive Distillation Improves Feature Learning via Implicit Curriculum Abhishek Panigrahi, Bingbin Liu, Sadhika Malladi, Andrej Risteski, Surbhi Goel
ICMLW 2024 Progressive Distillation Improves Feature Learning via Implicit Curriculum Abhishek Panigrahi, Bingbin Liu, Sadhika Malladi, Andrej Risteski, Surbhi Goel
NeurIPSW 2024 Progressive Distillation Induces an Implicit Curriculum Abhishek Panigrahi, Bingbin Liu, Sadhika Malladi, Andrej Risteski, Surbhi Goel
NeurIPSW 2024 Provable Unlearning in Topic Modeling and Downstream Tasks Stanley Wei, Sadhika Malladi, Sanjeev Arora, Amartya Sanyal
ICLR 2024 The Marginal Value of Momentum for Small Learning Rate SGD Runzhe Wang, Sadhika Malladi, Tianhao Wang, Kaifeng Lyu, Zhiyuan Li
ICML 2024 Trainable Transformer in Transformer Abhishek Panigrahi, Sadhika Malladi, Mengzhou Xia, Sanjeev Arora
NeurIPSW 2024 Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization Noam Razin, Sadhika Malladi, Adithya Bhaskar, Danqi Chen, Sanjeev Arora, Boris Hanin
NeurIPSW 2024 Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization Noam Razin, Sadhika Malladi, Adithya Bhaskar, Danqi Chen, Sanjeev Arora, Boris Hanin
ICML 2023 A Kernel-Based View of Language Model Fine-Tuning Sadhika Malladi, Alexander Wettig, Dingli Yu, Danqi Chen, Sanjeev Arora
ICLRW 2023 A Kernel-Based View of Language Model Fine-Tuning Sadhika Malladi, Alexander Wettig, Dingli Yu, Danqi Chen, Sanjeev Arora
NeurIPS 2023 Fine-Tuning Language Models with Just Forward Passes Sadhika Malladi, Tianyu Gao, Eshaan Nichani, Alex Damian, Jason Lee, Danqi Chen, Sanjeev Arora
ICMLW 2023 Fine-Tuning Language Models with Just Forward Passes Sadhika Malladi, Tianyu Gao, Eshaan Nichani, Jason D. Lee, Danqi Chen, Sanjeev Arora
ICMLW 2023 Fine-Tuning Language Models with Just Forward Passes Sadhika Malladi, Tianyu Gao, Eshaan Nichani, Alex Damian, Jason D. Lee, Danqi Chen, Sanjeev Arora
NeurIPSW 2023 Trainable Transformer in Transformer Abhishek Panigrahi, Sadhika Malladi, Mengzhou Xia, Sanjeev Arora
NeurIPS 2022 On the SDEs and Scaling Rules for Adaptive Gradient Algorithms Sadhika Malladi, Kaifeng Lyu, Abhishek Panigrahi, Sanjeev Arora
ICLR 2021 A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks Nikunj Saunshi, Sadhika Malladi, Sanjeev Arora
NeurIPS 2021 On the Validity of Modeling SGD with Stochastic Differential Equations (SDEs) Zhiyuan Li, Sadhika Malladi, Sanjeev Arora