Lewis, Richard L

18 publications

NeurIPS 2023 Combining Behaviors with the Successor Features Keyboard Wilka Carvalho Carvalho, Andre Saraiva, Angelos Filos, Andrew Lampinen, Loic Matthey, Richard L Lewis, Honglak Lee, Satinder P. Singh, Danilo Jimenez Rezende, Daniel Zoran
NeurIPS 2023 Large Language Models Can Implement Policy Iteration Ethan Brooks, Logan Walls, Richard L Lewis, Satinder P. Singh
AAAI 2022 Adaptive Pairwise Weights for Temporal Credit Assignment Zeyu Zheng, Risto Vuorio, Richard L. Lewis, Satinder Singh
NeurIPS 2021 Learning State Representations from Random Deep Action-Conditional Predictions Zeyu Zheng, Vivek Veeriah, Risto Vuorio, Richard L Lewis, Satinder P. Singh
IJCAI 2021 Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in a First-Person Simulated 3D Environment Wilka Carvalho, Anthony Liang, Kimin Lee, Sungryull Sohn, Honglak Lee, Richard L. Lewis, Satinder Singh
ICML 2021 Reinforcement Learning of Implicit and Explicit Control Flow Instructions Ethan Brooks, Janarthanan Rajendran, Richard L Lewis, Satinder Singh
AAAI 2020 How Should an Agent Practice? Janarthanan Rajendran, Richard L. Lewis, Vivek Veeriah, Honglak Lee, Satinder Singh
NeurIPS 2019 Discovery of Useful Questions as Auxiliary Tasks Vivek Veeriah, Matteo Hessel, Zhongwen Xu, Janarthanan Rajendran, Richard L. Lewis, Junhyuk Oh, Hado P van Hasselt, David Silver, Satinder Singh
AAAI 2019 Learning to Communicate and Solve Visual Blocks-World Tasks Qi Zhang, Richard L. Lewis, Satinder Singh, Edmund H. Durfee
IJCAI 2016 Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games Xiaoxiao Guo, Satinder Singh, Richard L. Lewis, Honglak Lee
IJCAI 2016 The Dependence of Effective Planning Horizon on Model Accuracy Nan Jiang, Alex Kulesza, Satinder Singh, Richard L. Lewis
NeurIPS 2015 Action-Conditional Video Prediction Using Deep Networks in Atari Games Junhyuk Oh, Xiaoxiao Guo, Honglak Lee, Richard L. Lewis, Satinder Singh
NeurIPS 2014 Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning Xiaoxiao Guo, Satinder Singh, Honglak Lee, Richard L. Lewis, Xiaoshi Wang
NeurIPS 2013 Reward Mapping for Transfer in Long-Lived Agents Xiaoxiao Guo, Satinder Singh, Richard L. Lewis
AAAI 2011 Optimal Rewards Versus Leaf-Evaluation Heuristics in Planning Agents Jonathan Sorg, Satinder Singh, Richard L. Lewis
ICML 2010 Internal Rewards Mitigate Agent Boundedness Jonathan Sorg, Satinder Singh, Richard L. Lewis
NeurIPS 2010 Reward Design via Online Gradient Ascent Jonathan Sorg, Richard L. Lewis, Satinder P. Singh
UAI 2010 Variance-Based Rewards for Approximate Bayesian Reinforcement Learning Jonathan Sorg, Satinder Singh, Richard L. Lewis