Brenner, Michael
13 publications
ICLR
2026
CMT-Benchmark: A Benchmark for Condensed Matter Theory Built by Expert Researchers
Haining Pan, James V Roggeveen, Erez Berg, Juan Felipe Carrasquilla Alvarez, Debanjan Chowdhury, Surya Ganguli, Federico Ghimenti, Juraj Hasik, Henry S. Hunt, Hong-Chen Jiang, Mason Kamb, Ying-Jer Kao, Ehsan Khatami, Michael J Lawler, Di Luo, Titus Neupert, Xiaoliang Qi, Michael Brenner, Eun-Ah Kim ICLR
2025
CURIE: Evaluating LLMs on Multitask Scientific Long-Context Understanding and Reasoning
Hao Cui, Zahra Shamsi, Gowoon Cheon, Xuejian Ma, Shutong Li, Maria Tikhanovskaya, Peter Christian Norgaard, Nayantara Mudur, Martyna Beata Plomecka, Paul Raccuglia, Yasaman Bahri, Victor V. Albert, Pranesh Srinivasan, Haining Pan, Philippe Faist, Brian A Rohr, Michael J. Statt, Dan Morris, Drew Purves, Elise Kleeman, Ruth Alcantara, Matthew Abraham, Muqthar Mohammad, Ean Phing VanLee, Chenfei Jiang, Elizabeth Dorfman, Eun-Ah Kim, Michael Brenner, Sameera S Ponda, Subhashini Venugopalan NeurIPS
2025
HARDMath2: A Benchmark for Applied Mathematics Built by Students as Part of a Graduate Class
James V Roggeveen, Erik Y. Wang, David Ettel, Will Flintoft, Peter Donets, Raglan Ward, Ahmed Roman, Anton Marius Graf, Siddharth Dandavate, Ava Williamson, Felix Yeung, Kacper K Migacz, Yijun Wang, Egemen Bostan, Duy Thuc Nguyen, Zhe He, Marc L. Descoteaux, Anne Mykland, Shida Liu, Jorge GarcĂa Ponce, Luke Zhu, Yuyang Chen, Ekaterina S. Ivshina, Miguel Fernandez, Minjae Kim, Kennan Gumbs, Matthew Scott Tan, Russell Yang, Mai Hoang, David Brown, Isabella A Silveira, Lavon Sykes, Arjun Nageswaran, William Fredenberg, Yiming Chen, Lucas Martin, Yixing Tang, Kelly Werker Smith, Hongyu Liao, Logan G. Wilson, Alexander Dazhen Cai, Lucy S. Nathwani, Nickholas Gutierrez, Andrea Elizabeth Biju, Michael Brenner