Poddar, Sriyash

2 publications

NeurIPS 2024 Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning Sriyash Poddar, Yanming Wan, Hamish Ivison, Abhishek Gupta, Natasha Jaques
NeurIPSW 2024 Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning Sriyash Poddar, Yanming Wan, Hamish Ivison, Abhishek Gupta, Natasha Jaques