ML Anthology
Authors
Search
About
Lewis, Richard L
18 publications
NeurIPS
2023
Combining Behaviors with the Successor Features Keyboard
Wilka Carvalho Carvalho
,
Andre Saraiva
,
Angelos Filos
,
Andrew Lampinen
,
Loic Matthey
,
Richard L Lewis
,
Honglak Lee
,
Satinder P. Singh
,
Danilo Jimenez Rezende
,
Daniel Zoran
NeurIPS
2023
Large Language Models Can Implement Policy Iteration
Ethan Brooks
,
Logan Walls
,
Richard L Lewis
,
Satinder P. Singh
AAAI
2022
Adaptive Pairwise Weights for Temporal Credit Assignment
Zeyu Zheng
,
Risto Vuorio
,
Richard L. Lewis
,
Satinder Singh
NeurIPS
2021
Learning State Representations from Random Deep Action-Conditional Predictions
Zeyu Zheng
,
Vivek Veeriah
,
Risto Vuorio
,
Richard L Lewis
,
Satinder P. Singh
IJCAI
2021
Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in a First-Person Simulated 3D Environment
Wilka Carvalho
,
Anthony Liang
,
Kimin Lee
,
Sungryull Sohn
,
Honglak Lee
,
Richard L. Lewis
,
Satinder Singh
ICML
2021
Reinforcement Learning of Implicit and Explicit Control Flow Instructions
Ethan Brooks
,
Janarthanan Rajendran
,
Richard L Lewis
,
Satinder Singh
AAAI
2020
How Should an Agent Practice?
Janarthanan Rajendran
,
Richard L. Lewis
,
Vivek Veeriah
,
Honglak Lee
,
Satinder Singh
NeurIPS
2019
Discovery of Useful Questions as Auxiliary Tasks
Vivek Veeriah
,
Matteo Hessel
,
Zhongwen Xu
,
Janarthanan Rajendran
,
Richard L. Lewis
,
Junhyuk Oh
,
Hado P van Hasselt
,
David Silver
,
Satinder Singh
AAAI
2019
Learning to Communicate and Solve Visual Blocks-World Tasks
Qi Zhang
,
Richard L. Lewis
,
Satinder Singh
,
Edmund H. Durfee
IJCAI
2016
Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games
Xiaoxiao Guo
,
Satinder Singh
,
Richard L. Lewis
,
Honglak Lee
IJCAI
2016
The Dependence of Effective Planning Horizon on Model Accuracy
Nan Jiang
,
Alex Kulesza
,
Satinder Singh
,
Richard L. Lewis
NeurIPS
2015
Action-Conditional Video Prediction Using Deep Networks in Atari Games
Junhyuk Oh
,
Xiaoxiao Guo
,
Honglak Lee
,
Richard L. Lewis
,
Satinder Singh
NeurIPS
2014
Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning
Xiaoxiao Guo
,
Satinder Singh
,
Honglak Lee
,
Richard L. Lewis
,
Xiaoshi Wang
NeurIPS
2013
Reward Mapping for Transfer in Long-Lived Agents
Xiaoxiao Guo
,
Satinder Singh
,
Richard L. Lewis
AAAI
2011
Optimal Rewards Versus Leaf-Evaluation Heuristics in Planning Agents
Jonathan Sorg
,
Satinder Singh
,
Richard L. Lewis
ICML
2010
Internal Rewards Mitigate Agent Boundedness
Jonathan Sorg
,
Satinder Singh
,
Richard L. Lewis
NeurIPS
2010
Reward Design via Online Gradient Ascent
Jonathan Sorg
,
Richard L. Lewis
,
Satinder P. Singh
UAI
2010
Variance-Based Rewards for Approximate Bayesian Reinforcement Learning
Jonathan Sorg
,
Satinder Singh
,
Richard L. Lewis