Vemuri, Sushil

2 publications

ICLR 2026 Curriculum Reinforcement Learning from Easy to Hard Tasks Improves LLM Reasoning Shubham Parashar, Shurui Gui, Xiner Li, Hongyi Ling, Sushil Vemuri, Blake Olson, Eric Li, Yu Zhang, James Caverlee, Dileep Kalathil, Shuiwang Ji
NeurIPS 2025 Robust LLM Alignment via Distributionally Robust Direct Preference Optimization Zaiyan Xu, Sushil Vemuri, Kishan Panaganti, Dileep Kalathil, Rahul Jain, Deepak Ramachandran