Chan, Alan

11 publications

TMLR 2025 Infrastructure for AI Agents Alan Chan, Kevin Wei, Sihao Huang, Nitarshan Rajkumar, Elija Perrier, Seth Lazar, Gillian K Hadfield, Markus Anderljung
TMLR 2025 Open Problems in Technical AI Governance Anka Reuel, Benjamin Bucknall, Stephen Casper, Timothy Fist, Lisa Soder, Onni Aarne, Lewis Hammond, Lujain Ibrahim, Alan Chan, Peter Wills, Markus Anderljung, Ben Garfinkel, Lennart Heim, Andrew Trask, Gabriel Mukobi, Rylan Schaeffer, Mauricio Baker, Sara Hooker, Irene Solaiman, Sasha Luccioni, Nitarshan Rajkumar, Nicolas Moës, Jeffrey Ladish, David Bau, Paul Bricman, Neel Guha, Jessica Newman, Yoshua Bengio, Tobin South, Alex Pentland, Sanmi Koyejo, Mykel Kochenderfer, Robert Trager
ICML 2025 Position: AI Agents Need Authenticated Delegation Tobin South, Samuele Marro, Thomas Hardjono, Robert Mahari, Cedric Deslandes Whitney, Alan Chan, Alex Pentland
TMLR 2024 Foundational Challenges in Assuring Alignment and Safety of Large Language Models Usman Anwar, Abulhair Saparov, Javier Rando, Daniel Paleka, Miles Turpin, Peter Hase, Ekdeep Singh Lubana, Erik Jenner, Stephen Casper, Oliver Sourbut, Benjamin L. Edelman, Zhaowei Zhang, Mario Günther, Anton Korinek, Jose Hernandez-Orallo, Lewis Hammond, Eric J Bigelow, Alexander Pan, Lauro Langosco, Tomasz Korbak, Heidi Chenyu Zhang, Ruiqi Zhong, Sean O hEigeartaigh, Gabriel Recchia, Giulio Corsi, Alan Chan, Markus Anderljung, Lilian Edwards, Aleksandar Petrov, Christian Schroeder de Witt, Sumeet Ramesh Motwani, Yoshua Bengio, Danqi Chen, Philip Torr, Samuel Albanie, Tegan Maharaj, Jakob Nicolaus Foerster, Florian Tramèr, He He, Atoosa Kasirzadeh, Yejin Choi, David Krueger
NeurIPSW 2024 IDs for AI Systems Alan Chan, Noam Kolt, Peter Wills, Usman Anwar, Christian Schroeder de Witt, Nitarshan Rajkumar, Lewis Hammond, David Krueger, Lennart Heim, Markus Anderljung
NeurIPSW 2023 An International Consortium for AI Risk Evaluations Ross Gruetzemacher, Alan Chan, Štěpán Los, Kevin Frazier, Siméon Campos, Matija Franklin, James Fox, Jose Hernandez-Orallo, Christin Manning, Philip Tomei, Kyle Kilian
NeurIPSW 2023 Hazards from Increasingly Accessible Fine-Tuning of Downloadable Foundation Models Alan Chan, Benjamin Bucknall, Herbie Bradley, David Krueger
NeurIPSW 2023 Welfare Diplomacy: Benchmarking Language Model Cooperation Gabriel Mukobi, Hannah Erlebach, Niklas Lauffer, Lewis Hammond, Alan Chan, Jesse Clifton
JMLR 2022 Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences Alan Chan, Hugo Silva, Sungsu Lim, Tadashi Kozuno, A. Rupam Mahmood, Martha White
AAAI 2020 Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning Kristopher De Asis, Alan Chan, Silviu Pitis, Richard S. Sutton, Daniel Graves
ICLR 2020 Training Recurrent Neural Networks Online by Learning Explicit State Variables Somjit Nath, Vincent Liu, Alan Chan, Xin Li, Adam White, Martha White