Hsu, Sheryl

3 publications

ICLR 2026 FSPO: Few-Shot Optimization of Synthetic Preferences Effectively Personalizes to Real Users Anikait Singh, Sheryl Hsu, Kyle Hsu, Eric Mitchell, Stefano Ermon, Tatsunori Hashimoto, Archit Sharma, Chelsea Finn
ICLR 2025 Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval Sheryl Hsu, Omar Khattab, Chelsea Finn, Archit Sharma
ICML 2024 RLVF: Learning from Verbal Feedback Without Overgeneralization Moritz Pascal Stephan, Alexander Khazatsky, Eric Mitchell, Annie S Chen, Sheryl Hsu, Archit Sharma, Chelsea Finn