Du, Yuqing

12 publications

ICML 2024 Learning to Model the World with Language Jessy Lin, Yuqing Du, Olivia Watkins, Danijar Hafner, Pieter Abbeel, Dan Klein, Anca Dragan
ICMLW 2024 Teaching Large Language Models to Reason with Reinforcement Learning Alexander Havrilla, Yuqing Du, Sharath Chandra Raparthy, Christoforos Nalmpantis, Jane Dwivedi-Yu, Eric Hambro, Sainbayar Sukhbaatar, Roberta Raileanu
NeurIPSW 2023 A Study on Improving Reasoning in Language Models Yuqing Du, Alexander Havrilla, Sainbayar Sukhbaatar, Pieter Abbeel, Roberta Raileanu
NeurIPS 2023 DPOK: Reinforcement Learning for Fine-Tuning Text-to-Image Diffusion Models Ying Fan, Olivia Watkins, Yuqing Du, Hao Liu, Moonkyung Ryu, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Kangwook Lee, Kimin Lee
ICML 2023 Guiding Pretraining in Reinforcement Learning with Large Language Models Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, Jacob Andreas
CoLLAs 2023 Vision-Language Models as Success Detectors Yuqing Du, Ksenia Konyushkova, Misha Denil, Akhil Raju, Jessica Landon, Felix Hill, Nando Freitas, Serkan Cabi
NeurIPSW 2023 What Can AI Learn from Human Exploration? Intrinsically-Motivated Humans and Agents in Open-World Exploration Yuqing Du, Eliza Kosoy, Alyssa Li Dayan, Maria Rufova, Alison Gopnik, Pieter Abbeel
NeurIPSW 2023 What Can AI Learn from Human Exploration? Intrinsically-Motivated Humans and Agents in Open-World Exploration Yuqing Du, Eliza Kosoy, Alyssa Dayan, Maria Rufova, Pieter Abbeel, Alison Gopnik
NeurIPSW 2023 What Can AI Learn from Human Exploration? Intrinsically-Motivated Humans and Agents in Open-World Exploration Yuqing Du, Eliza Kosoy, Alyssa Dayan, Maria Rufova, Pieter Abbeel, Alison Gopnik
ICML 2022 Bayesian Imitation Learning for End-to-End Mobile Manipulation Yuqing Du, Daniel Ho, Alex Alemi, Eric Jang, Mohi Khansari
ICLR 2022 It Takes Four to Tango: Multiagent Self Play for Automatic Curriculum Generation Yuqing Du, Pieter Abbeel, Aditya Grover
NeurIPS 2020 AvE: Assistance via Empowerment Yuqing Du, Stas Tiomkin, Emre Kiciman, Daniel Polani, Pieter Abbeel, Anca Dragan