Verma, Shivanshu

2 publications

TMLR 2025 Triple Preference Optimization: Achieving Better Alignment Using a Single Step Optimization Amir Saeidi, Shivanshu Verma, Kashif Rasul, Aswin Rrv, Chitta Baral
NeurIPSW 2023 Learning Generalizable Symbolic Options for Transfer in Reinforcement Learning Rashmeet Kaur Nayyar, Shivanshu Verma, Siddharth Srivastava