Brenner, Michael

12 publications

ICLR 2025 CURIE: Evaluating LLMs on Multitask Scientific Long-Context Understanding and Reasoning Hao Cui, Zahra Shamsi, Gowoon Cheon, Xuejian Ma, Shutong Li, Maria Tikhanovskaya, Peter Christian Norgaard, Nayantara Mudur, Martyna Beata Plomecka, Paul Raccuglia, Yasaman Bahri, Victor V. Albert, Pranesh Srinivasan, Haining Pan, Philippe Faist, Brian A Rohr, Michael J. Statt, Dan Morris, Drew Purves, Elise Kleeman, Ruth Alcantara, Matthew Abraham, Muqthar Mohammad, Ean Phing VanLee, Chenfei Jiang, Elizabeth Dorfman, Eun-Ah Kim, Michael Brenner, Sameera S Ponda, Subhashini Venugopalan
NeurIPS 2025 HARDMath2: A Benchmark for Applied Mathematics Built by Students as Part of a Graduate Class James V Roggeveen, Erik Y. Wang, David Ettel, Will Flintoft, Peter Donets, Raglan Ward, Ahmed Roman, Anton Marius Graf, Siddharth Dandavate, Ava Williamson, Felix Yeung, Kacper K Migacz, Yijun Wang, Egemen Bostan, Duy Thuc Nguyen, Zhe He, Marc L. Descoteaux, Anne Mykland, Shida Liu, Jorge GarcĂ­a Ponce, Luke Zhu, Yuyang Chen, Ekaterina S. Ivshina, Miguel Fernandez, Minjae Kim, Kennan Gumbs, Matthew Scott Tan, Russell Yang, Mai Hoang, David Brown, Isabella A Silveira, Lavon Sykes, Arjun Nageswaran, William Fredenberg, Yiming Chen, Lucas Martin, Yixing Tang, Kelly Werker Smith, Hongyu Liao, Logan G. Wilson, Alexander Dazhen Cai, Lucy S. Nathwani, Nickholas Gutierrez, Andrea Elizabeth Biju, Michael Brenner
ICLR 2025 HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics Jingxuan Fan, Sarah Martinson, Erik Y. Wang, Kaylie Hausknecht, Jonah Brenner, Danxian Liu, Nianli Peng, Corey Wang, Michael Brenner
NeurIPSW 2024 FEABench: Evaluating Language Models on Real World Physics Reasoning Ability Nayantara Mudur, Hao Cui, Subhashini Venugopalan, Paul Raccuglia, Michael Brenner, Peter Christian Norgaard
NeurIPSW 2024 FEABench: Evaluating Language Models on Real World Physics Reasoning Ability Nayantara Mudur, Hao Cui, Subhashini Venugopalan, Paul Raccuglia, Michael Brenner, Peter Christian Norgaard
NeurIPSW 2024 HARDMATH: A Benchmark Dataset for Challenging Problems in Applied Mathematics Jingxuan Fan, Sarah Martinson, Erik Y. Wang, Kaylie Hausknecht, Jonah Brenner, Danxian Liu, Nianli Peng, Corey Wang, Michael Brenner
TMLR 2023 Learning to Correct Spectral Methods for Simulating Turbulent Flows Gideon Dresdner, Dmitrii Kochkov, Peter Christian Norgaard, Leonardo Zepeda-Nunez, Jamie Smith, Michael Brenner, Stephan Hoyer
ICML 2021 Variational Data Assimilation with a Learned Inverse Observation Operator Thomas Frerix, Dmitrii Kochkov, Jamie Smith, Daniel Cremers, Michael Brenner, Stephan Hoyer
AAAI 2010 Creating Dynamic Story Plots with Continual Multiagent Planning Michael Brenner
IJCAI 2007 Mediating Between Qualitative and Quantitative Representations for Task-Orientated Human-Robot Interaction Michael Brenner, Nick Hawes, John D. Kelleher, Jeremy L. Wyatt
AAAI 2007 Towards an Integrated Robot with Multiple Cognitive Functions Nick Hawes, Aaron Sloman, Jeremy L. Wyatt, Michael Zillich, Henrik Jacobsson, Geert-Jan M. Kruijff, Michael Brenner, Gregor Berginc, Danijel Skocaj
IJCAI 2003 Multiagent Planning with Partially Ordered Temporal Plans Michael Brenner