Brown-Cohen, Jonah

7 publications

NeurIPS 2024 On Scalable Oversight with Weak LLMs Judging Strong LLMs Zachary Kenton, Noah Y. Siegel, János Kramár, Jonah Brown-Cohen, Samuel Albanie, Jannis Bulian, Rishabh Agarwal, David Lindner, Yunhao Tang, Noah D. Goodman, Rohin Shah

ICLR 2024 SKILL-MIX: A Flexible and Expandable Family of Evaluations for AI Models Dingli Yu, Simran Kaur, Arushi Gupta, Jonah Brown-Cohen, Anirudh Goyal, Sanjeev Arora

ICML 2024 Scalable AI Safety via Doubly-Efficient Debate Jonah Brown-Cohen, Geoffrey Irving, Georgios Piliouras

ICMLW 2024 Scalable AI Safety via Doubly-Efficient Debate Jonah Brown-Cohen, Geoffrey Irving, Georgios Piliouras

ICML 2023 Detecting Adversarial Directions in Deep Reinforcement Learning to Make Robust Decisions Ezgi Korkmaz, Jonah Brown-Cohen

NeurIPSW 2023 Skill-Mix: A Flexible and Expandable Family of Evaluations for AI Models Dingli Yu, Simran Kaur, Arushi Gupta, Jonah Brown-Cohen, Anirudh Goyal, Sanjeev Arora

NeurIPS 2021 Faster Algorithms and Constant Lower Bounds for the Worst-Case Expected Error Jonah Brown-Cohen