Mitchell, Eric

22 publications

NeurIPS 2024 A Critical Evaluation of AI Feedback for Aligning Large Language Models Archit Sharma, Sedrick Keh, Eric Mitchell, Chelsea Finn, Kushal Arora, Thomas Kollar
ICLR 2024 An Emulator for Fine-Tuning Large Language Models Using Small Language Models Eric Mitchell, Rafael Rafailov, Archit Sharma, Chelsea Finn, Christopher D Manning
ICLRW 2024 Calibrating Language Models with Adaptive Temperature Scaling Johnathan Xie, Annie S Chen, Yoonho Lee, Eric Mitchell, Chelsea Finn
ICLR 2024 Fine-Tuning Language Models for Factuality Katherine Tian, Eric Mitchell, Huaxiu Yao, Christopher D Manning, Chelsea Finn
ICLR 2024 Language Model Detectors Are Easily Optimized Against Charlotte Nicks, Eric Mitchell, Rafael Rafailov, Archit Sharma, Christopher D Manning, Chelsea Finn, Stefano Ermon
NeurIPS 2024 Online Adaptation of Language Models with a Memory of Amortized Contexts Jihoon Tack, Jaehyung Kim, Eric Mitchell, Jinwoo Shin, Yee Whye Teh, Jonathan Richard Schwarz
ICML 2024 RLVF: Learning from Verbal Feedback Without Overgeneralization Moritz Pascal Stephan, Alexander Khazatsky, Eric Mitchell, Annie S Chen, Sheryl Hsu, Archit Sharma, Chelsea Finn
NeurIPSW 2023 An Emulator for Fine-Tuning Large Language Models Using Small Language Models Eric Mitchell, Rafael Rafailov, Archit Sharma, Chelsea Finn, Christopher Manning
ICML 2023 DetectGPT: Zero-Shot Machine-Generated Text Detection Using Probability Curvature Eric Mitchell, Yoonho Lee, Alexander Khazatsky, Christopher D Manning, Chelsea Finn
NeurIPS 2023 Direct Preference Optimization: Your Language Model Is Secretly a Reward Model Rafael Rafailov, Archit Sharma, Eric Mitchell, Christopher D Manning, Stefano Ermon, Chelsea Finn
ICMLW 2023 Direct Preference Optimization: Your Language Model Is Secretly a Reward Model Rafael Rafailov, Archit Sharma, Eric Mitchell, Stefano Ermon, Christopher D Manning, Chelsea Finn
NeurIPSW 2023 Fine-Tuning Language Models for Factuality Katherine Tian, Eric Mitchell, Huaxiu Yao, Christopher Manning, Chelsea Finn
NeurIPSW 2023 Language Model Detectors Are Easily Optimized Against Charlotte Nicks, Eric Mitchell, Rafael Rafailov, Archit Sharma, Christopher Manning, Chelsea Finn, Stefano Ermon
NeurIPS 2023 RECKONING: Reasoning Through Dynamic Knowledge Encoding Zeming Chen, Gail Weiss, Eric Mitchell, Asli Celikyilmaz, Antoine Bosselut
ICLR 2022 Fast Model Editing at Scale Eric Mitchell, Charles Lin, Antoine Bosselut, Chelsea Finn, Christopher D Manning
ICML 2022 Memory-Based Model Editing at Scale Eric Mitchell, Charles Lin, Antoine Bosselut, Christopher D Manning, Chelsea Finn
ICMLW 2022 Self-Destructing Models: Increasing the Costs of Harmful Dual Uses in Foundation Models Eric Mitchell, Peter Henderson, Christopher D Manning, Dan Jurafsky, Chelsea Finn
CoRL 2021 Learning Language-Conditioned Robot Behavior from Offline Data and Crowd-Sourced Annotation Suraj Nair, Eric Mitchell, Kevin Chen, Brian Ichter, Silvio Savarese, Chelsea Finn
ICML 2021 Offline Meta-Reinforcement Learning with Advantage Weighting Eric Mitchell, Rafael Rafailov, Xue Bin Peng, Sergey Levine, Chelsea Finn
ICLR 2020 Higher-Order Function Networks for Learning Composable 3D Object Representations Eric Mitchell, Selim Engin, Volkan Isler, Daniel D Lee
IJCAI 2020 Reward Prediction Error as an Exploration Objective in Deep RL Riley Simmons-Edler, Ben Eisner, Daniel Yang, Anthony Bisulco, Eric Mitchell, H. Sebastian Seung, Daniel D. Lee
ICMLW 2019 Q-Learning for Continuous Actions with Cross-Entropy Guided Policies Riley Simmons-Edler, Ben Eisner, Eric Mitchell, Sebastian Seung, Daniel Lee