Biyik, Erdem

17 publications

NeurIPS 2025 Actor-Free Continuous Control via Structurally Maximizable Q-Functions Yigit Korkmaz, Urvi Bhuwania, Ayush Jain, Erdem Biyik
AAAI 2025 Efficient Robot Learning via Interaction with Humans Erdem Biyik
CoRL 2025 ReWiND: Language-Guided Rewards Teach Robot Policies Without New Demonstrations Jiahui Zhang, Yusen Luo, Abrar Anwar, Sumedh Anand Sontakke, Joseph J Lim, Jesse Thomason, Erdem Biyik, Jesse Zhang
ICML 2024 Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach for Adaptive Brain Stimulation Michelle Pan, Mariah L Schrum, Vivek Myers, Erdem Biyik, Anca Dragan
NeurIPS 2024 DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning Anthony Liang, Guy Tennenholtz, Chih-Wei Hsu, Yinlam Chow, Erdem Biyik, Craig Boutilier
ICMLW 2024 DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning Anthony Liang, Guy Tennenholtz, ChihWei Hsu, Yinlam Chow, Erdem Biyik, Craig Boutilier
CoRL 2024 EXTRACT: Efficient Policy Learning by Extracting Transferable Robot Skills from Offline Data Jesse Zhang, Minho Heo, Zuxin Liu, Erdem Biyik, Joseph J Lim, Yao Liu, Rasool Fakoor
ICMLW 2024 In-Context Generalization to New Tasks from Unlabeled Observation Data Anthony Liang, Pavel Czempin, Yutai Zhou, Stephen Tu, Erdem Biyik
ICML 2024 RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model Feedback Yufei Wang, Zhanyi Sun, Jesse Zhang, Zhou Xian, Erdem Biyik, David Held, Zackory Erickson
CoRL 2024 Trajectory Improvement and Reward Learning from Comparative Language Feedback Zhaojing Yang, Miru Jun, Jeremy Tien, Stuart Russell, Anca Dragan, Erdem Biyik
TMLR 2023 Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomek Korbak, David Lindner, Pedro Freire, Tony Tong Wang, Samuel Marks, Charbel-Raphael Segerie, Micah Carroll, Andi Peng, Phillip J.K. Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen, Lauro Langosco, Peter Hase, Erdem Biyik, Anca Dragan, David Krueger, Dorsa Sadigh, Dylan Hadfield-Menell
NeurIPS 2022 Assistive Teaching of Motor Control Tasks to Humans Megha Srivastava, Erdem Biyik, Suvir Mirchandani, Noah Goodman, Dorsa Sadigh
AAAI 2022 Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams Erdem Biyik, Anusha Lalitha, Rajarshi Saha, Andrea Goldsmith, Dorsa Sadigh
IJCAI 2021 Emergent Prosociality in Multi-Agent Games Through Gifting Woodrow Z. Wang, Mark Beliaev, Erdem Biyik, Daniel A. Lazar, Ramtin Pedarsani, Dorsa Sadigh
CoRL 2021 Learning Multimodal Rewards from Rankings Vivek Myers, Erdem Biyik, Nima Anari, Dorsa Sadigh
CoRL 2021 Learning Reward Functions from Scale Feedback Nils Wilde, Erdem Biyik, Dorsa Sadigh, Stephen L. Smith
CoRL 2018 Batch Active Preference-Based Learning of Reward Functions Erdem Biyik, Dorsa Sadigh