Zayats, Vicky

1 publications

TMLR 2025 Robust Preference Optimization Through Reward Model Distillation Adam Fisch, Jacob Eisenstein, Vicky Zayats, Alekh Agarwal, Ahmad Beirami, Chirag Nagpal, Peter Shaw, Jonathan Berant