Weinstein-Raun, Benjamin

3 publications

TMLR 2026 Incomplete Tasks Induce Shutdown Resistance in Some Frontier LLMs Jeremy Schlatter, Benjamin Weinstein-Raun, Jeffrey Ladish
JAIR 2025 Thousands of AI Authors on the Future of AI Katja Grace, Julia Fabienne Sandkühler, Harlan Stewart, Benjamin Weinstein-Raun, Stephen Thomas, Zach Stein-Perlman, John Salvatier, Jan Brauner, Richard C. Korzekwa
NeurIPS 2022 Adversarial Training for High-Stakes Reliability Daniel Ziegler, Seraphina Nix, Lawrence Chan, Tim Bauman, Peter Schmidt-Nielsen, Tao Lin, Adam Scherlis, Noa Nabeshima, Benjamin Weinstein-Raun, Daniel de Haas, Buck Shlegeris, Nate Thomas