Mitchell, Eric

22 publications

NeurIPS 2024 A Critical Evaluation of AI Feedback for Aligning Large Language Models Archit Sharma, Sedrick Keh, Eric Mitchell, Chelsea Finn, Kushal Arora, Thomas Kollar

ICLR 2024 An Emulator for Fine-Tuning Large Language Models Using Small Language Models Eric Mitchell, Rafael Rafailov, Archit Sharma, Chelsea Finn, Christopher D Manning

ICLRW 2024 Calibrating Language Models with Adaptive Temperature Scaling Johnathan Xie, Annie S Chen, Yoonho Lee, Eric Mitchell, Chelsea Finn

ICLR 2024 Fine-Tuning Language Models for Factuality Katherine Tian, Eric Mitchell, Huaxiu Yao, Christopher D Manning, Chelsea Finn

ICLR 2024 Language Model Detectors Are Easily Optimized Against Charlotte Nicks, Eric Mitchell, Rafael Rafailov, Archit Sharma, Christopher D Manning, Chelsea Finn, Stefano Ermon

NeurIPS 2024 Online Adaptation of Language Models with a Memory of Amortized Contexts Jihoon Tack, Jaehyung Kim, Eric Mitchell, Jinwoo Shin, Yee Whye Teh, Jonathan Richard Schwarz

ICML 2024 RLVF: Learning from Verbal Feedback Without Overgeneralization Moritz Pascal Stephan, Alexander Khazatsky, Eric Mitchell, Annie S Chen, Sheryl Hsu, Archit Sharma, Chelsea Finn

NeurIPSW 2023 An Emulator for Fine-Tuning Large Language Models Using Small Language Models Eric Mitchell, Rafael Rafailov, Archit Sharma, Chelsea Finn, Christopher Manning

ICML 2023 DetectGPT: Zero-Shot Machine-Generated Text Detection Using Probability Curvature Eric Mitchell, Yoonho Lee, Alexander Khazatsky, Christopher D Manning, Chelsea Finn

NeurIPS 2023 Direct Preference Optimization: Your Language Model Is Secretly a Reward Model Rafael Rafailov, Archit Sharma, Eric Mitchell, Christopher D Manning, Stefano Ermon, Chelsea Finn

ICMLW 2023 Direct Preference Optimization: Your Language Model Is Secretly a Reward Model Rafael Rafailov, Archit Sharma, Eric Mitchell, Stefano Ermon, Christopher D Manning, Chelsea Finn

NeurIPSW 2023 Fine-Tuning Language Models for Factuality Katherine Tian, Eric Mitchell, Huaxiu Yao, Christopher Manning, Chelsea Finn

NeurIPSW 2023 Language Model Detectors Are Easily Optimized Against Charlotte Nicks, Eric Mitchell, Rafael Rafailov, Archit Sharma, Christopher Manning, Chelsea Finn, Stefano Ermon

NeurIPS 2023 RECKONING: Reasoning Through Dynamic Knowledge Encoding Zeming Chen, Gail Weiss, Eric Mitchell, Asli Celikyilmaz, Antoine Bosselut

ICLR 2022 Fast Model Editing at Scale Eric Mitchell, Charles Lin, Antoine Bosselut, Chelsea Finn, Christopher D Manning

ICML 2022 Memory-Based Model Editing at Scale Eric Mitchell, Charles Lin, Antoine Bosselut, Christopher D Manning, Chelsea Finn

ICMLW 2022 Self-Destructing Models: Increasing the Costs of Harmful Dual Uses in Foundation Models Eric Mitchell, Peter Henderson, Christopher D Manning, Dan Jurafsky, Chelsea Finn

CoRL 2021 Learning Language-Conditioned Robot Behavior from Offline Data and Crowd-Sourced Annotation Suraj Nair, Eric Mitchell, Kevin Chen, Brian Ichter, Silvio Savarese, Chelsea Finn

ICML 2021 Offline Meta-Reinforcement Learning with Advantage Weighting Eric Mitchell, Rafael Rafailov, Xue Bin Peng, Sergey Levine, Chelsea Finn

ICLR 2020 Higher-Order Function Networks for Learning Composable 3D Object Representations Eric Mitchell, Selim Engin, Volkan Isler, Daniel D Lee

IJCAI 2020 Reward Prediction Error as an Exploration Objective in Deep RL Riley Simmons-Edler, Ben Eisner, Daniel Yang, Anthony Bisulco, Eric Mitchell, H. Sebastian Seung, Daniel D. Lee

ICMLW 2019 Q-Learning for Continuous Actions with Cross-Entropy Guided Policies Riley Simmons-Edler, Ben Eisner, Eric Mitchell, Sebastian Seung, Daniel Lee