Wan, Yanming
7 publications
ICLR
2026
Learning to Summarize User Information for Personalized Reinforcement Learning from Human Feedback
NeurIPS
2024
Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning
7 publications