Srikant, R
26 publications
ICML
2024
Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization
AISTATS
2023
Learning While Scheduling in Multi-Server Systems with Unknown Statistics: MaxWeight with Discounted UCB
AISTATS
2023
On the Convergence of Policy Iteration-Based Reinforcement Learning with Monte Carlo Policy Evaluation