Agarwal, Prabhat

1 publications

TMLR 2025 Unified Preference Optimization: Language Model Alignment Beyond the Preference Frontier Anirudhan Badrinath, Prabhat Agarwal, Jiajing Xu