Schwettmann, Sarah

8 publications

ICML 2025 Eliciting Language Model Behaviors with Investigator Agents Xiang Lisa Li, Neil Chowdhury, Daniel D. Johnson, Tatsunori Hashimoto, Percy Liang, Sarah Schwettmann, Jacob Steinhardt
NeurIPS 2025 Establishing Best Practices in Building Rigorous Agentic Benchmarks Yuxuan Zhu, Tengjun Jin, Yada Pruksachatkun, Andy K Zhang, Shu Liu, Sasha Cui, Sayash Kapoor, Shayne Longpre, Kevin Meng, Rebecca Weiss, Fazl Barez, Rahul Gupta, Jwala Dhamala, Jacob Merizian, Mario Giulianelli, Harry Coppock, Cozmin Ududec, Antony Kellermann, Jasjeet S Sekhon, Jacob Steinhardt, Sarah Schwettmann, Arvind Narayanan, Matei Zaharia, Ion Stoica, Percy Liang, Daniel Kang
ICML 2024 A Multimodal Automated Interpretability Agent Tamar Rott Shaham, Sarah Schwettmann, Franklin Wang, Achyuta Rajaram, Evan Hernandez, Jacob Andreas, Antonio Torralba
NeurIPSW 2023 An Alternative to Regulation: The Case for Public AI Nicholas Vincent, David Bau, Sarah Schwettmann, Joshua Tan
NeurIPS 2023 FIND: A Function Description Benchmark for Evaluating Interpretability Methods Sarah Schwettmann, Tamar Shaham, Joanna Materzynska, Neil Chowdhury, Shuang Li, Jacob Andreas, David Bau, Antonio Torralba
ICCVW 2023 Multimodal Neurons in Pretrained Text-Only Transformers Sarah Schwettmann, Neil Chowdhury, Samuel Klein, David Bau, Antonio Torralba
ICLR 2022 Natural Language Descriptions of Deep Visual Features Evan Hernandez, Sarah Schwettmann, David Bau, Teona Bagashvili, Antonio Torralba, Jacob Andreas
ICCV 2021 Toward a Visual Concept Vocabulary for GAN Latent Space Sarah Schwettmann, Evan Hernandez, David Bau, Samuel Klein, Jacob Andreas, Antonio Torralba