Pelrine, Kellin
18 publications
TMLR
2026
Open Technical Problems in Open-Weight AI Model Risk Management
Stephen Casper, Kyle O'Brien, Shayne Longpre, Elizabeth Seger, Kevin Klyman, Rishi Bommasani, Aniruddha Nrusimha, Ilia Shumailov, Sören Mindermann, Steven Basart, Frank Rudzicz, Kellin Pelrine, Avijit Ghosh, Andrew Strait, Robert Kirk, Dan Hendrycks, Peter Henderson, J Zico Kolter, Geoffrey Irving, Yarin Gal, Yoshua Bengio, Dylan Hadfield-Menell ICLRW
2025
From Intuition to Understanding: Using AI Peers to Overcome Physics Misconceptions
Ruben Weijers, Denton Wu, Hannah Betts, Tamara Jacod, Yuxiang Guan, Vidya Sujaya, Kushal Dev, William Delooze, Reihaneh Rabbany, Ying Wu, Jean-François Godbout, Kellin Pelrine ICLRW
2025
From Intuition to Understanding: Using AI Peers to Overcome Physics Misconceptions
Ruben Weijers, Denton Wu, Hannah Betts, Tamara Jacod, Yuxiang Guan, Vidya Sujaya, Kushal Dev, Toshali Goel, William Delooze, Reihaneh Rabbany, Ying Wu, Jean-François Godbout, Kellin Pelrine IJCAI
2025
SandboxSocial: A Sandbox for Social Media Using Multimodal AI Agents
Maximilian Puelma Touzel, Sneheel Sarangi, Gayatri Krishnakumar, Busra Tugce Gurbuz, Austin Welch, Zachary Yang, Andreea Musulan, Hao Yu, Ethan Kosak-Hine, Tom Gibbs, Camille Thibault, Reihaneh Rabbany, Jean-François Godbout, Dan Zhao, Kellin Pelrine IJCAI
2025
Veracity: An Open-Source AI Fact-Checking System
Taylor Lynn Curtis, Maximilian Puelma Touzel, William Garneau, Manon Gruaz, Mike Pinder, Li Wei Wang, Sukanya Krishna, Luda Cohen, Jean-François Godbout, Reihaneh Rabbany, Kellin Pelrine NeurIPSW
2024
Simulation System Towards Solving Societal-Scale Manipulation
Maximilian Puelma Touzel, Sneheel Sarangi, Austin Welch, Gayatri K, Dan Zhao, Zachary Yang, Hao Yu, Tom Gibbs, Ethan Kosak-Hine, Andreea Musulan, Camille Thibault, Busra Tugce Gurbuz, Reihaneh Rabbany, Jean-François Godbout, Kellin Pelrine NeurIPSW
2024
Simulation System Towards Solving Societal-Scale Manipulation
Maximilian Puelma Touzel, Sneheel Sarangi, Austin Welch, Gayatri K, Dan Zhao, Zachary Yang, Hao Yu, Tom Gibbs, Ethan Kosak-Hine, Andreea Musulan, Camille Thibault, Reihaneh Rabbany, Jean-François Godbout, Kellin Pelrine ICML
2023
Adversarial Policies Beat Superhuman Go AIs
Tony Tong Wang, Adam Gleave, Tom Tseng, Kellin Pelrine, Nora Belrose, Joseph Miller, Michael D Dennis, Yawen Duan, Viktor Pogrebniak, Sergey Levine, Stuart Russell