Brown-Cohen, Jonah

7 publications

NeurIPS 2024 On Scalable Oversight with Weak LLMs Judging Strong LLMs Zachary Kenton, Noah Y. Siegel, János Kramár, Jonah Brown-Cohen, Samuel Albanie, Jannis Bulian, Rishabh Agarwal, David Lindner, Yunhao Tang, Noah D. Goodman, Rohin Shah
ICLR 2024 SKILL-MIX: A Flexible and Expandable Family of Evaluations for AI Models Dingli Yu, Simran Kaur, Arushi Gupta, Jonah Brown-Cohen, Anirudh Goyal, Sanjeev Arora
ICML 2024 Scalable AI Safety via Doubly-Efficient Debate Jonah Brown-Cohen, Geoffrey Irving, Georgios Piliouras
ICMLW 2024 Scalable AI Safety via Doubly-Efficient Debate Jonah Brown-Cohen, Geoffrey Irving, Georgios Piliouras
ICML 2023 Detecting Adversarial Directions in Deep Reinforcement Learning to Make Robust Decisions Ezgi Korkmaz, Jonah Brown-Cohen
NeurIPSW 2023 Skill-Mix: A Flexible and Expandable Family of Evaluations for AI Models Dingli Yu, Simran Kaur, Arushi Gupta, Jonah Brown-Cohen, Anirudh Goyal, Sanjeev Arora
NeurIPS 2021 Faster Algorithms and Constant Lower Bounds for the Worst-Case Expected Error Jonah Brown-Cohen