Rrv, Aswin

1 publications

TMLR 2025 Triple Preference Optimization: Achieving Better Alignment Using a Single Step Optimization Amir Saeidi, Shivanshu Verma, Kashif Rasul, Aswin Rrv, Chitta Baral