Bradley, Herbie

11 publications

ICML 2025 Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive? Rylan Schaeffer, Hailey Schoelkopf, Brando Miranda, Gabriel Mukobi, Varun Madan, Adam Ibrahim, Herbie Bradley, Stella Biderman, Sanmi Koyejo
ICLR 2024 Quality-Diversity Through AI Feedback Herbie Bradley, Andrew Dai, Hannah Benita Teufel, Jenny Zhang, Koen Oostermeijer, Marco Bellagente, Jeff Clune, Kenneth Stanley, Gregory Schott, Joel Lehman
ICMLW 2024 Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive? Rylan Schaeffer, Hailey Schoelkopf, Brando Miranda, Gabriel Mukobi, Varun Madan, Adam Ibrahim, Herbie Bradley, Stella Biderman, Sanmi Koyejo
ICMLW 2024 Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive? Rylan Schaeffer, Hailey Schoelkopf, Brando Miranda, Gabriel Mukobi, Varun Madan, Adam Ibrahim, Herbie Bradley, Stella Biderman, Sanmi Koyejo
NeurIPSW 2023 Detecting Backdoors with Meta-Models Lauro Langosco, Neel Alex, William Baker, David Quarel, Herbie Bradley, David Krueger
ICMLW 2023 Do LLMs Selectively Encode the Goal of an Agent's Reach? Laura Ruis, Arduin Findeis, Herbie Bradley, Hossein A. Rahmani, Kyoung Whan Choe, Edward Grefenstette, Tim Rocktäschel
NeurIPSW 2023 Hazards from Increasingly Accessible Fine-Tuning of Downloadable Foundation Models Alan Chan, Benjamin Bucknall, Herbie Bradley, David Krueger
NeurIPS 2023 Neural MMO 2.0: A Massively Multi-Task Addition to Massively Multi-Agent Learning Joseph Suarez, David Bloomin, Kyoung Whan Choe, Hao Xiang Li, Ryan Sullivan, Nishaanth Kanna, Daniel Scott, Rose Shuman, Herbie Bradley, Louis Castricato, Phillip Isola, Chenghui Yu, Yuhao Jiang, Qimai Li, Jiaxin Chen, Xiaolong Zhu
ICML 2023 Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling Stella Biderman, Hailey Schoelkopf, Quentin Gregory Anthony, Herbie Bradley, Kyle O’Brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, Usvsn Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar Van Der Wal
NeurIPSW 2023 Quality-Diversity Through AI Feedback Herbie Bradley, Andrew Dai, Hannah Benita Teufel, Jenny Zhang, Koen Oostermeijer, Marco Bellagente, Jeff Clune, Kenneth Stanley, Gregory Schott, Joel Lehman
NeurIPSW 2022 EleutherAI: Going Beyond "Open Science" to "Science in the Open" Jason Phang, Herbie Bradley, Leo Gao, Louis J. Castricato, Stella Biderman