Fox, Roy

24 publications

L4DC 2025 Realizable Continuous-Space Shields for Safe Reinforcement Learning Kyungmin Kim, Davide Corsi, Andoni Rodrı́guez, Jb Lanier, Benjami Parellada, Pierre Baldi, César Sánchez, Roy Fox
ICML 2024 Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills Kolby Nottingham, Bodhisattwa Prasad Majumder, Bhavana Dalvi Mishra, Sameer Singh, Peter Clark, Roy Fox
ICLR 2024 Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games Stephen Marcus McAleer, Jb Lanier, Kevin A. Wang, Pierre Baldi, Tuomas Sandholm, Roy Fox
ICML 2023 Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making Using Language Guided World Modelling Kolby Nottingham, Prithviraj Ammanabrolu, Alane Suhr, Yejin Choi, Hannaneh Hajishirzi, Sameer Singh, Roy Fox
ICLRW 2023 Do Embodied Agents Dream of Pixelated Sheep?: Embodied Decision Making Using Language Guided World Modelling Kolby Nottingham, Prithviraj Ammanabrolu, Alane Suhr, Yejin Choi, Hannaneh Hajishirzi, Sameer Singh, Roy Fox
ICML 2023 Learning to Design Analog Circuits to Meet Threshold Specifications Dmitrii Krylov, Pooya Khajeh, Junhan Ouyang, Thomas Reeves, Tongkai Liu, Hiba Ajmal, Hamidreza Aghasi, Roy Fox
NeurIPSW 2023 Selective Perception: Learning Concise State Descriptions for Language Model Actors Kolby Nottingham, Yasaman Razeghi, Kyungmin Kim, Jb Lanier, Pierre Baldi, Roy Fox, Sameer Singh
AISTATS 2022 Independent Natural Policy Gradient Always Converges in Markov Potential Games Roy Fox, Stephen M. Mcaleer, Will Overman, Ioannis Panageas
NeurIPSW 2022 Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments John Banister Lanier, Stephen Marcus McAleer, Pierre Baldi, Roy Fox
ICML 2022 Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks Litian Liang, Yaosheng Xu, Stephen Mcaleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox
NeurIPSW 2021 Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning Dailin Hu, Pieter Abbeel, Roy Fox
NeurIPSW 2021 Target Entropy Annealing for Discrete Soft Actor-Critic Yaosheng Xu, Dailin Hu, Litian Liang, Stephen Marcus McAleer, Pieter Abbeel, Roy Fox
NeurIPSW 2021 Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates Litian Liang, Yaosheng Xu, Stephen Marcus McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox
NeurIPS 2021 XDO: A Double Oracle Algorithm for Extensive-Form Games Stephen McAleer, Jb Lanier, Kevin A Wang, Pierre Baldi, Roy Fox
NeurIPS 2020 Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games Stephen Mcaleer, Jb Lanier, Roy Fox, Pierre Baldi
ICMLW 2019 Multi-Task Learning via Task Multi-Clustering Andy Yan, Xin Wang, Ion Stoica, Joseph Gonzalez, Roy Fox
ICLR 2018 Parametrized Hierarchical Procedures for Neural Programming Roy Fox, Richard Shin, Sanjay Krishnan, Ken Goldberg, Dawn Song, Ion Stoica
ICML 2018 RLlib: Abstractions for Distributed Reinforcement Learning Eric Liang, Richard Liaw, Robert Nishihara, Philipp Moritz, Roy Fox, Ken Goldberg, Joseph Gonzalez, Michael Jordan, Ion Stoica
CoRL 2017 DART: Noise Injection for Robust Imitation Learning Michael Laskey, Jonathan Lee, Roy Fox, Anca D. Dragan, Ken Goldberg
CoRL 2017 DDCO: Discovery of Deep Continuous Options for Robot Learning from Demonstrations Sanjay Krishnan, Roy Fox, Ion Stoica, Ken Goldberg
UAI 2016 Taming the Noise in Reinforcement Learning via Soft Updates Roy Fox, Ari Pakman, Naftali Tishby
NeurIPS 2013 A Multi-Agent Control Framework for Co-Adaptation in Brain-Computer Interfaces Josh S Merel, Roy Fox, Tony Jebara, Liam Paninski
ICML 2012 Bounded Planning in Passive POMDPs Roy Fox, Naftali Tishby
AAAI 2007 A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs Roy Fox, Moshe Tennenholtz