Deb, Rohan

7 publications

ICLR 2025 Conservative Contextual Bandits: Beyond Linear Representations Rohan Deb, Mohammad Ghavamzadeh, Arindam Banerjee
ICLRW 2025 Data-Efficient Supervised Fine-Tuning of Language Models Using Optimal Design Rohan Deb, Kiran Koshy Thekumparampil, Kousha Kalantari, Gaurush Hiranandani, Shoham Sabach, Branislav Kveton
ICML 2025 FisherSFT: Data-Efficient Supervised Fine-Tuning of Language Models Using Information Gain Rohan Deb, Kiran Koshy Thekumparampil, Kousha Kalantari, Gaurush Hiranandani, Shoham Sabach, Branislav Kveton
ICLR 2024 Contextual Bandits with Online Neural Regression Rohan Deb, Yikun Ban, Shiliang Zuo, Jingrui He, Arindam Banerjee
AISTATS 2024 Think Before You Duel: Understanding Complexities of Preference Learning Under Constrained Resources Rohan Deb, Aadirupa Saha, Arindam Banerjee
UAI 2023 Does Momentum Help in Stochastic Optimization? a Sample Complexity Analysis. Swetha Ganesh, Rohan Deb, Gugan Thoppe, Amarjit Budhiraja
AAAI 2022 Gradient Temporal Difference with Momentum: Stability and Convergence Rohan Deb, Shalabh Bhatnagar