Lewis, Richard L

18 publications

NeurIPS 2023 Combining Behaviors with the Successor Features Keyboard Wilka Carvalho Carvalho, Andre Saraiva, Angelos Filos, Andrew Lampinen, Loic Matthey, Richard L Lewis, Honglak Lee, Satinder P. Singh, Danilo Jimenez Rezende, Daniel Zoran

NeurIPS 2023 Large Language Models Can Implement Policy Iteration Ethan Brooks, Logan Walls, Richard L Lewis, Satinder P. Singh

AAAI 2022 Adaptive Pairwise Weights for Temporal Credit Assignment Zeyu Zheng, Risto Vuorio, Richard L. Lewis, Satinder Singh

NeurIPS 2021 Learning State Representations from Random Deep Action-Conditional Predictions Zeyu Zheng, Vivek Veeriah, Risto Vuorio, Richard L Lewis, Satinder P. Singh

IJCAI 2021 Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in a First-Person Simulated 3D Environment Wilka Carvalho, Anthony Liang, Kimin Lee, Sungryull Sohn, Honglak Lee, Richard L. Lewis, Satinder Singh

ICML 2021 Reinforcement Learning of Implicit and Explicit Control Flow Instructions Ethan Brooks, Janarthanan Rajendran, Richard L Lewis, Satinder Singh

AAAI 2020 How Should an Agent Practice? Janarthanan Rajendran, Richard L. Lewis, Vivek Veeriah, Honglak Lee, Satinder Singh

NeurIPS 2019 Discovery of Useful Questions as Auxiliary Tasks Vivek Veeriah, Matteo Hessel, Zhongwen Xu, Janarthanan Rajendran, Richard L. Lewis, Junhyuk Oh, Hado P van Hasselt, David Silver, Satinder Singh

AAAI 2019 Learning to Communicate and Solve Visual Blocks-World Tasks Qi Zhang, Richard L. Lewis, Satinder Singh, Edmund H. Durfee

IJCAI 2016 Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games Xiaoxiao Guo, Satinder Singh, Richard L. Lewis, Honglak Lee

IJCAI 2016 The Dependence of Effective Planning Horizon on Model Accuracy Nan Jiang, Alex Kulesza, Satinder Singh, Richard L. Lewis

NeurIPS 2015 Action-Conditional Video Prediction Using Deep Networks in Atari Games Junhyuk Oh, Xiaoxiao Guo, Honglak Lee, Richard L. Lewis, Satinder Singh

NeurIPS 2014 Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning Xiaoxiao Guo, Satinder Singh, Honglak Lee, Richard L. Lewis, Xiaoshi Wang

NeurIPS 2013 Reward Mapping for Transfer in Long-Lived Agents Xiaoxiao Guo, Satinder Singh, Richard L. Lewis

AAAI 2011 Optimal Rewards Versus Leaf-Evaluation Heuristics in Planning Agents Jonathan Sorg, Satinder Singh, Richard L. Lewis

ICML 2010 Internal Rewards Mitigate Agent Boundedness Jonathan Sorg, Satinder Singh, Richard L. Lewis

NeurIPS 2010 Reward Design via Online Gradient Ascent Jonathan Sorg, Richard L. Lewis, Satinder P. Singh

UAI 2010 Variance-Based Rewards for Approximate Bayesian Reinforcement Learning Jonathan Sorg, Satinder Singh, Richard L. Lewis