Deb, Rohan

7 publications

ICLR 2025 Conservative Contextual Bandits: Beyond Linear Representations Rohan Deb, Mohammad Ghavamzadeh, Arindam Banerjee

ICLRW 2025 Data-Efficient Supervised Fine-Tuning of Language Models Using Optimal Design Rohan Deb, Kiran Koshy Thekumparampil, Kousha Kalantari, Gaurush Hiranandani, Shoham Sabach, Branislav Kveton

ICML 2025 FisherSFT: Data-Efficient Supervised Fine-Tuning of Language Models Using Information Gain Rohan Deb, Kiran Koshy Thekumparampil, Kousha Kalantari, Gaurush Hiranandani, Shoham Sabach, Branislav Kveton

ICLR 2024 Contextual Bandits with Online Neural Regression Rohan Deb, Yikun Ban, Shiliang Zuo, Jingrui He, Arindam Banerjee

AISTATS 2024 Think Before You Duel: Understanding Complexities of Preference Learning Under Constrained Resources Rohan Deb, Aadirupa Saha, Arindam Banerjee

UAI 2023 Does Momentum Help in Stochastic Optimization? a Sample Complexity Analysis. Swetha Ganesh, Rohan Deb, Gugan Thoppe, Amarjit Budhiraja

AAAI 2022 Gradient Temporal Difference with Momentum: Stability and Convergence Rohan Deb, Shalabh Bhatnagar