Rrv, Aswin

2 publications

ICLR 2026 GuidedSampling: Steering LLMs Towards Diverse Candidate Solutions at Inference-Time Divij Handa, Mihir Parmar, Aswin Rrv, Md Nayem Uddin, Hamid Palangi, Chitta Baral
TMLR 2025 Triple Preference Optimization: Achieving Better Alignment Using a Single Step Optimization Amir Saeidi, Shivanshu Verma, Kashif Rasul, Aswin Rrv, Chitta Baral