ML Anthology
Authors
Search
About
Sharma, Archit
24 publications
ICLR
2025
Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval
Sheryl Hsu
,
Omar Khattab
,
Chelsea Finn
,
Archit Sharma
ICLRW
2025
Policy-Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone
Max Sobol Mark
,
Tian Gao
,
Georgia Gabriela Sampaio
,
Mohan Kumar Srirama
,
Archit Sharma
,
Chelsea Finn
,
Aviral Kumar
NeurIPS
2024
A Critical Evaluation of AI Feedback for Aligning Large Language Models
Archit Sharma
,
Sedrick Keh
,
Eric Mitchell
,
Chelsea Finn
,
Kushal Arora
,
Thomas Kollar
ICLR
2024
An Emulator for Fine-Tuning Large Language Models Using Small Language Models
Eric Mitchell
,
Rafael Rafailov
,
Archit Sharma
,
Chelsea Finn
,
Christopher D Manning
ICLR
2024
Language Model Detectors Are Easily Optimized Against
Charlotte Nicks
,
Eric Mitchell
,
Rafael Rafailov
,
Archit Sharma
,
Christopher D Manning
,
Chelsea Finn
,
Stefano Ermon
ICML
2024
Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data
Fahim Tajwar
,
Anikait Singh
,
Archit Sharma
,
Rafael Rafailov
,
Jeff Schneider
,
Tengyang Xie
,
Stefano Ermon
,
Chelsea Finn
,
Aviral Kumar
ICML
2024
RLVF: Learning from Verbal Feedback Without Overgeneralization
Moritz Pascal Stephan
,
Alexander Khazatsky
,
Eric Mitchell
,
Annie S Chen
,
Sheryl Hsu
,
Archit Sharma
,
Chelsea Finn
NeurIPSW
2023
An Emulator for Fine-Tuning Large Language Models Using Small Language Models
Eric Mitchell
,
Rafael Rafailov
,
Archit Sharma
,
Chelsea Finn
,
Christopher Manning
NeurIPS
2023
Direct Preference Optimization: Your Language Model Is Secretly a Reward Model
Rafael Rafailov
,
Archit Sharma
,
Eric Mitchell
,
Christopher D Manning
,
Stefano Ermon
,
Chelsea Finn
ICMLW
2023
Direct Preference Optimization: Your Language Model Is Secretly a Reward Model
Rafael Rafailov
,
Archit Sharma
,
Eric Mitchell
,
Stefano Ermon
,
Christopher D Manning
,
Chelsea Finn
NeurIPSW
2023
Language Model Detectors Are Easily Optimized Against
Charlotte Nicks
,
Eric Mitchell
,
Rafael Rafailov
,
Archit Sharma
,
Christopher Manning
,
Chelsea Finn
,
Stefano Ermon
CoRL
2023
Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement Learning
Archit Sharma
,
Ahmed M. Ahmed
,
Rehaan Ahmad
,
Chelsea Finn
CoRL
2023
Waypoint-Based Imitation Learning for Robotic Manipulation
Lucy Xiaoyang Shi
,
Archit Sharma
,
Tony Z. Zhao
,
Chelsea Finn
ICML
2022
A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning
Archit Sharma
,
Rehaan Ahmad
,
Chelsea Finn
ICLR
2022
Autonomous Reinforcement Learning: Formalism and Benchmarking
Archit Sharma
,
Kelvin Xu
,
Nikhil Sardana
,
Abhishek Gupta
,
Karol Hausman
,
Sergey Levine
,
Chelsea Finn
NeurIPS
2022
When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning
Annie Xie
,
Fahim Tajwar
,
Archit Sharma
,
Chelsea Finn
ICMLW
2022
When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning
Annie Xie
,
Fahim Tajwar
,
Archit Sharma
,
Chelsea Finn
NeurIPS
2022
You Only Live Once: Single-Life Reinforcement Learning
Annie Chen
,
Archit Sharma
,
Sergey Levine
,
Chelsea Finn
ICMLW
2022
You Only Live Once: Single-Life Reinforcement Learning via Learned Reward Shaping
Annie S Chen
,
Archit Sharma
,
Sergey Levine
,
Chelsea Finn
NeurIPS
2021
Autonomous Reinforcement Learning via Subgoal Curricula
Archit Sharma
,
Abhishek Gupta
,
Sergey Levine
,
Karol Hausman
,
Chelsea Finn
NeurIPSW
2021
Discriminator Augmented Model-Based Reinforcement Learning
Behzad Haghgoo
,
Allan Zhou
,
Archit Sharma
,
Chelsea Finn
ICML
2021
Variational Empowerment as Representation Learning for Goal-Conditioned Reinforcement Learning
Jongwook Choi
,
Archit Sharma
,
Honglak Lee
,
Sergey Levine
,
Shixiang Shane Gu
ICLR
2020
Dynamics-Aware Unsupervised Skill Discovery
Archit Sharma
,
Shixiang Gu
,
Sergey Levine
,
Vikash Kumar
,
Karol Hausman
MLJ
2019
A Flexible Probabilistic Framework for Large-Margin Mixture of Experts
Archit Sharma
,
Siddhartha Saxena
,
Piyush Rai