Duvvuri, Sai Surya

7 publications

ICLR 2026 Let's (not) Just Put Things in Context: Test-Time Training for Long-Context LLMs Rachit Bansal, Aston Zhang, Rishabh Tiwari, Lovish Madaan, Sai Surya Duvvuri, Fnu Devvrit, David Brandfonbrener, David Alvarez-Melis, Prajjwal Bhargava, Mihir Kale, Samy Jelassi
ICLR 2026 The Art of Scaling Reinforcement Learning Compute for LLMs Fnu Devvrit, Lovish Madaan, Rishabh Tiwari, Rachit Bansal, Sai Surya Duvvuri, Manzil Zaheer, Inderjit S Dhillon, David Brandfonbrener, Rishabh Agarwal
ICML 2025 LASER: Attention with Exponential Transformation Sai Surya Duvvuri, Inderjit S Dhillon
ICLR 2025 LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization Jui-Nan Yen, Si Si, Zhao Meng, Felix Yu, Sai Surya Duvvuri, Inderjit S Dhillon, Cho-Jui Hsieh, Sanjiv Kumar
ICLR 2024 Combining Axes Preconditioners Through Kronecker Approximation for Deep Learning Sai Surya Duvvuri, Fnu Devvrit, Rohan Anil, Cho-Jui Hsieh, Inderjit S Dhillon
NeurIPS 2023 A Computationally Efficient Sparsified Online Newton Method Fnu Devvrit, Sai Surya Duvvuri, Rohan Anil, Vineet Gupta, Cho-Jui Hsieh, Inderjit S. Dhillon
NeurIPS 2023 Block Low-Rank Preconditioner with Shared Basis for Stochastic Optimization Jui-Nan Yen, Sai Surya Duvvuri, Inderjit S. Dhillon, Cho-Jui Hsieh