Hsu, Sheryl

3 publications

ICLR 2026 FSPO: Few-Shot Optimization of Synthetic Preferences Effectively Personalizes to Real Users Anikait Singh, Sheryl Hsu, Kyle Hsu, Eric Mitchell, Stefano Ermon, Tatsunori Hashimoto, Archit Sharma, Chelsea Finn

ICLR 2025 Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval Sheryl Hsu, Omar Khattab, Chelsea Finn, Archit Sharma

ICML 2024 RLVF: Learning from Verbal Feedback Without Overgeneralization Moritz Pascal Stephan, Alexander Khazatsky, Eric Mitchell, Annie S Chen, Sheryl Hsu, Archit Sharma, Chelsea Finn