McAleer, Stephen

16 publications

AAAI 2024 Automated Design of Affine Maximizer Mechanisms in Dynamic Settings Michael J. Curry, Vinzenz Thoma, Darshan Chakrabarti, Stephen McAleer, Christian Kroer, Tuomas Sandholm, Niao He, Sven Seuken
IJCAI 2024 Policy Space Response Oracles: A Survey Ariyan Bighashdel, Yongzhao Wang, Stephen McAleer, Rahul Savani, Frans A. Oliehoek
IJCAI 2024 Scalable Mechanism Design for Multi-Agent Path Finding Paul Friedrich, Yulun Zhang, Michael J. Curry, Ludwig Dierks, Stephen McAleer, Jiaoyang Li, Tuomas Sandholm, Sven Seuken
NeurIPS 2023 Computing Optimal Equilibria and Mechanisms via Learning in Zero-Sum Extensive-Form Games Brian Zhang, Gabriele Farina, Ioannis Anagnostides, Federico Cacciamani, Stephen McAleer, Andreas Haupt, Andrea Celli, Nicola Gatti, Vincent Conitzer, Tuomas Sandholm
NeurIPSW 2023 Confronting Reward Model Overoptimization with Constrained RLHF Ted Moskovitz, Aaditya Singh, Dj Strouse, Tuomas Sandholm, Ruslan Salakhutdinov, Anca Dragan, Stephen McAleer
NeurIPS 2023 Language Models Can Solve Computer Tasks Geunwoo Kim, Pierre Baldi, Stephen McAleer
NeurIPSW 2023 Llemma: An Open Language Model for Mathematics Zhangir Azerbayev, Hailey Schoelkopf, Keiran Paster, Marco Dos Santos, Stephen McAleer, Albert Jiang, Jia Deng, Stella Biderman, Sean Welleck
NeurIPS 2023 Policy Space Diversity for Non-Transitive Games Jian Yao, Weiming Liu, Haobo Fu, Yaodong Yang, Stephen McAleer, Qiang Fu, Wei Yang
NeurIPS 2023 Team-PSRO for Learning Approximate TMECor in Large Team Games via Cooperative Reinforcement Learning Stephen McAleer, Gabriele Farina, Gaoyue Zhou, Mingzhi Wang, Yaodong Yang, Tuomas Sandholm
ICML 2022 Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks Litian Liang, Yaosheng Xu, Stephen Mcaleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox
NeurIPS 2022 Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning Yuanpei Chen, Tianhao Wu, Shengjie Wang, Xidong Feng, Jiechuan Jiang, Zongqing Lu, Stephen McAleer, Hao Dong, Song-Chun Zhu, Yaodong Yang
NeurIPS 2021 Neural Auto-Curricula in Two-Player Zero-Sum Games Xidong Feng, Oliver Slumbers, Ziyu Wan, Bo Liu, Stephen McAleer, Ying Wen, Jun Wang, Yaodong Yang
NeurIPS 2021 XDO: A Double Oracle Algorithm for Extensive-Form Games Stephen McAleer, Jb Lanier, Kevin A Wang, Pierre Baldi, Roy Fox
ICML 2020 Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination Somdeb Majumdar, Shauharda Khadka, Santiago Miret, Stephen Mcaleer, Kagan Tumer
NeurIPS 2020 Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games Stephen Mcaleer, Jb Lanier, Roy Fox, Pierre Baldi
ICLR 2019 Solving the Rubik's Cube with Approximate Policy Iteration Stephen McAleer, Forest Agostinelli, Alexander Shmakov, Pierre Baldi