Sikchi, Harshit

17 publications

ICLR 2025 An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning Haoran Xu, Shuozhe Li, Harshit Sikchi, Scott Niekum, Amy Zhang
ICML 2025 Proto Successor Measure: Representing the Behavior Space of an RL Agent Siddhant Agarwal, Harshit Sikchi, Peter Stone, Amy Zhang
ICLRW 2025 RL Zero: Zero-Shot Language to Behaviors Without Any Supervision Harshit Sikchi, Siddhant Agarwal, Pranaya Jajoo, Samyak Parajuli, Caleb Chuck, Max Rudolph, Peter Stone, Amy Zhang, Scott Niekum
NeurIPS 2025 RLZero: Direct Policy Inference from Language Without In-Domain Supervision Harshit Sikchi, Siddhant Agarwal, Pranaya Jajoo, Samyak Parajuli, Caleb Chuck, Max Rudolph, Peter Stone, Amy Zhang, Scott Niekum
CoRL 2024 A Dual Approach to Imitation Learning from Observations with Offline Datasets Harshit Sikchi, Caleb Chuck, Amy Zhang, Scott Niekum
ICLR 2024 Contrastive Preference Learning: Learning from Human Feedback Without Reinforcement Learning Joey Hejna, Rafael Rafailov, Harshit Sikchi, Chelsea Finn, Scott Niekum, W. Bradley Knox, Dorsa Sadigh
ICLR 2024 Dual RL: Unification and New Methods for Reinforcement and Imitation Learning Harshit Sikchi, Qinqing Zheng, Amy Zhang, Scott Niekum
NeurIPS 2024 Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms Rafael Rafailov, Yaswanth Chittepu, Ryan Park, Harshit Sikchi, Joey Hejna, W. Bradley Knox, Chelsea Finn, Scott Niekum
ICMLW 2024 Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms Rafael Rafailov, Yaswanth Chittepu, Ryan Park, Harshit Sikchi, Joey Hejna, W. Bradley Knox, Chelsea Finn, Scott Niekum
ICLR 2024 Score Models for Offline Goal-Conditioned Reinforcement Learning Harshit Sikchi, Rohan Chitnis, Ahmed Touati, Alborz Geramifard, Amy Zhang, Scott Niekum
TMLR 2023 A Ranking Game for Imitation Learning Harshit Sikchi, Akanksha Saran, Wonjoon Goo, Scott Niekum
ICMLW 2023 A Ranking Game for Imitation Learning Harshit Sikchi, Akanksha Saran, Wonjoon Goo, Scott Niekum
ICLRW 2023 Imitation from Arbitrary Experience: A Dual Unification of Reinforcement and Imitation Learning Methods Harshit Sikchi, Amy Zhang, Scott Niekum
NeurIPSW 2023 Score-Models for Offline Goal-Conditioned Reinforcement Learning Harshit Sikchi, Rohan Chitnis, Ahmed Touati, Alborz Geramifard, Amy Zhang, Scott Niekum
NeurIPSW 2022 A Ranking Game for Imitation Learning Harshit Sikchi, Akanksha Saran, Wonjoon Goo, Scott Niekum
CoRL 2021 Learning Off-Policy with Online Planning Harshit Sikchi, Wenxuan Zhou, David Held
CoRL 2020 F-IRL: Inverse Reinforcement Learning via State Marginal Matching Tianwei Ni, Harshit Sikchi, Yufei Wang, Tejus Gupta, Lisa Lee, Ben Eysenbach