Lanier, Jb

5 publications

L4DC 2025 Realizable Continuous-Space Shields for Safe Reinforcement Learning Kyungmin Kim, Davide Corsi, Andoni Rodrı́guez, Jb Lanier, Benjami Parellada, Pierre Baldi, César Sánchez, Roy Fox
ICLR 2024 Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games Stephen Marcus McAleer, Jb Lanier, Kevin A. Wang, Pierre Baldi, Tuomas Sandholm, Roy Fox
NeurIPSW 2023 Selective Perception: Learning Concise State Descriptions for Language Model Actors Kolby Nottingham, Yasaman Razeghi, Kyungmin Kim, Jb Lanier, Pierre Baldi, Roy Fox, Sameer Singh
NeurIPS 2021 XDO: A Double Oracle Algorithm for Extensive-Form Games Stephen McAleer, Jb Lanier, Kevin A Wang, Pierre Baldi, Roy Fox
NeurIPS 2020 Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games Stephen Mcaleer, Jb Lanier, Roy Fox, Pierre Baldi