Theobald, Barry-John
9 publications
AAAI
2024
Can You Rely on Synthetic Labellers in Preference-Based Reinforcement Learning? It's Complicated
NeurIPSW
2024
Dueling in the Dark: An Efficient and Optimal Mirror Descent Approach for Online Optimization with Adversarial Preferences