ML Anthology
Authors
Search
About
Zayats, Vicky
1 publications
TMLR
2025
Robust Preference Optimization Through Reward Model Distillation
Adam Fisch
,
Jacob Eisenstein
,
Vicky Zayats
,
Alekh Agarwal
,
Ahmad Beirami
,
Chirag Nagpal
,
Peter Shaw
,
Jonathan Berant