ML Anthology
Authors
Search
About
Fox, Roy
24 publications
L4DC
2025
Realizable Continuous-Space Shields for Safe Reinforcement Learning
Kyungmin Kim
,
Davide Corsi
,
Andoni Rodrı́guez
,
Jb Lanier
,
Benjami Parellada
,
Pierre Baldi
,
César Sánchez
,
Roy Fox
ICML
2024
Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills
Kolby Nottingham
,
Bodhisattwa Prasad Majumder
,
Bhavana Dalvi Mishra
,
Sameer Singh
,
Peter Clark
,
Roy Fox
ICLR
2024
Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games
Stephen Marcus McAleer
,
Jb Lanier
,
Kevin A. Wang
,
Pierre Baldi
,
Tuomas Sandholm
,
Roy Fox
ICML
2023
Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making Using Language Guided World Modelling
Kolby Nottingham
,
Prithviraj Ammanabrolu
,
Alane Suhr
,
Yejin Choi
,
Hannaneh Hajishirzi
,
Sameer Singh
,
Roy Fox
ICLRW
2023
Do Embodied Agents Dream of Pixelated Sheep?: Embodied Decision Making Using Language Guided World Modelling
Kolby Nottingham
,
Prithviraj Ammanabrolu
,
Alane Suhr
,
Yejin Choi
,
Hannaneh Hajishirzi
,
Sameer Singh
,
Roy Fox
ICML
2023
Learning to Design Analog Circuits to Meet Threshold Specifications
Dmitrii Krylov
,
Pooya Khajeh
,
Junhan Ouyang
,
Thomas Reeves
,
Tongkai Liu
,
Hiba Ajmal
,
Hamidreza Aghasi
,
Roy Fox
NeurIPSW
2023
Selective Perception: Learning Concise State Descriptions for Language Model Actors
Kolby Nottingham
,
Yasaman Razeghi
,
Kyungmin Kim
,
Jb Lanier
,
Pierre Baldi
,
Roy Fox
,
Sameer Singh
AISTATS
2022
Independent Natural Policy Gradient Always Converges in Markov Potential Games
Roy Fox
,
Stephen M. Mcaleer
,
Will Overman
,
Ioannis Panageas
NeurIPSW
2022
Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
John Banister Lanier
,
Stephen Marcus McAleer
,
Pierre Baldi
,
Roy Fox
ICML
2022
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Litian Liang
,
Yaosheng Xu
,
Stephen Mcaleer
,
Dailin Hu
,
Alexander Ihler
,
Pieter Abbeel
,
Roy Fox
NeurIPSW
2021
Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Dailin Hu
,
Pieter Abbeel
,
Roy Fox
NeurIPSW
2021
Target Entropy Annealing for Discrete Soft Actor-Critic
Yaosheng Xu
,
Dailin Hu
,
Litian Liang
,
Stephen Marcus McAleer
,
Pieter Abbeel
,
Roy Fox
NeurIPSW
2021
Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Litian Liang
,
Yaosheng Xu
,
Stephen Marcus McAleer
,
Dailin Hu
,
Alexander Ihler
,
Pieter Abbeel
,
Roy Fox
NeurIPS
2021
XDO: A Double Oracle Algorithm for Extensive-Form Games
Stephen McAleer
,
Jb Lanier
,
Kevin A Wang
,
Pierre Baldi
,
Roy Fox
NeurIPS
2020
Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
Stephen Mcaleer
,
Jb Lanier
,
Roy Fox
,
Pierre Baldi
ICMLW
2019
Multi-Task Learning via Task Multi-Clustering
Andy Yan
,
Xin Wang
,
Ion Stoica
,
Joseph Gonzalez
,
Roy Fox
ICLR
2018
Parametrized Hierarchical Procedures for Neural Programming
Roy Fox
,
Richard Shin
,
Sanjay Krishnan
,
Ken Goldberg
,
Dawn Song
,
Ion Stoica
ICML
2018
RLlib: Abstractions for Distributed Reinforcement Learning
Eric Liang
,
Richard Liaw
,
Robert Nishihara
,
Philipp Moritz
,
Roy Fox
,
Ken Goldberg
,
Joseph Gonzalez
,
Michael Jordan
,
Ion Stoica
CoRL
2017
DART: Noise Injection for Robust Imitation Learning
Michael Laskey
,
Jonathan Lee
,
Roy Fox
,
Anca D. Dragan
,
Ken Goldberg
CoRL
2017
DDCO: Discovery of Deep Continuous Options for Robot Learning from Demonstrations
Sanjay Krishnan
,
Roy Fox
,
Ion Stoica
,
Ken Goldberg
UAI
2016
Taming the Noise in Reinforcement Learning via Soft Updates
Roy Fox
,
Ari Pakman
,
Naftali Tishby
NeurIPS
2013
A Multi-Agent Control Framework for Co-Adaptation in Brain-Computer Interfaces
Josh S Merel
,
Roy Fox
,
Tony Jebara
,
Liam Paninski
ICML
2012
Bounded Planning in Passive POMDPs
Roy Fox
,
Naftali Tishby
AAAI
2007
A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs
Roy Fox
,
Moshe Tennenholtz