Kleiman-Weiner, Max

17 publications

ICLRW 2025 AI Systematically Rewires the Flow of Ideas Zhonghao He, Tianyi Qiu, Tao Lin, Moshe Glickman, Atoosa Kasirzadeh, John Wihbey, Max Kleiman-Weiner

ICML 2025 Cross-Environment Cooperation Enables Zero-Shot Multi-Agent Coordination Kunal Jha, Wilka Carvalho, Yancheng Liang, Simon Shaolei Du, Max Kleiman-Weiner, Natasha Jaques

NeurIPS 2025 Evaluating LLMs in Open-Source Games Swadesh Sistla, Max Kleiman-Weiner

ICLR 2025 Language Model Alignment in Multilingual Trolley Problems Zhijing Jin, Max Kleiman-Weiner, Giorgio Piatti, Sydney Levine, Jiarui Liu, Fernando Gonzalez Adauto, Francesco Ortu, András Strausz, Mrinmaya Sachan, Rada Mihalcea, Yejin Choi, Bernhard Schölkopf

ICML 2025 SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behavior Jing-Jing Li, Valentina Pyatkin, Max Kleiman-Weiner, Liwei Jiang, Nouha Dziri, Anne Collins, Jana Schaich Borg, Maarten Sap, Yejin Choi, Sydney Levine

ICML 2025 The Lock-in Hypothesis: Stagnation by Algorithm Tianyi Qiu, Zhonghao He, Tejasveer Chugh, Max Kleiman-Weiner

ICLRW 2025 The Lock-in Hypothesis: Stagnation by Algorithm Tianyi Qiu, Zhonghao He, Tejasveer Chugh, Max Kleiman-Weiner

NeurIPS 2024 Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents Giorgio Piatti, Zhijing Jin, Max Kleiman-Weiner, Bernhard Schölkopf, Mrinmaya Sachan, Rada Mihalcea

NeurIPSW 2024 InfiniteKitchen: Cross-Environment Cooperation for Zero-Shot Multi-Agent Coordination Kunal Jha, Natasha Jaques, Max Kleiman-Weiner

NeurIPSW 2024 Multilingual Trolley Problems for Language Models Zhijing Jin, Max Kleiman-Weiner, Giorgio Piatti, Sydney Levine, Jiarui Liu, Fernando Gonzalez Adauto, Francesco Ortu, András Strausz, Mrinmaya Sachan, Rada Mihalcea, Yejin Choi, Bernhard Schölkopf

NeurIPSW 2024 SafetyAnalyst: Interpretable, Transparent, and Steerable LLM Safety Moderation Jing-Jing Li, Valentina Pyatkin, Max Kleiman-Weiner, Liwei Jiang, Nouha Dziri, Anne Collins, Jana Schaich Borg, Maarten Sap, Yejin Choi, Sydney Levine

NeurIPS 2023 CLadder: Assessing Causal Reasoning in Language Models Zhijing Jin, Yuen Chen, Felix Leeb, Luigi Gresele, Ojasv Kamal, Zhiheng Lyu, Kevin Blin, Fernando Gonzalez Adauto, Max Kleiman-Weiner, Mrinmaya Sachan, Bernhard Schölkopf

ICML 2023 Learning Intuitive Policies Using Action Features Mingwei Ma, Jizhou Liu, Samuel Sokota, Max Kleiman-Weiner, Jakob Nicolaus Foerster

NeurIPS 2019 Finding Friend and Foe in Multi-Agent Games Jack Serrino, Max Kleiman-Weiner, David C. Parkes, Josh Tenenbaum

AAAI 2019 Theory of Minds: Understanding Behavior in Groups Through Inverse Planning Michael Shum, Max Kleiman-Weiner, Michael L. Littman, Joshua B. Tenenbaum

NeurIPS 2018 Learning to Share and Hide Intentions Using Information Regularization Dj Strouse, Max Kleiman-Weiner, Josh Tenenbaum, Matt Botvinick, David J Schwab

AAAI 2018 Towards Formal Definitions of Blameworthiness, Intention, and Moral Responsibility Joseph Y. Halpern, Max Kleiman-Weiner