Mukherjee, Debajoy

1 publications

ICLR 2025 DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback Guojun Xiong, Ujwal Dinesha, Debajoy Mukherjee, Jian Li, Srinivas Shakkottai