Brenner, Michael
12 publications
ICLR
2025
CURIE: Evaluating LLMs on Multitask Scientific Long-Context Understanding and Reasoning
Hao Cui, Zahra Shamsi, Gowoon Cheon, Xuejian Ma, Shutong Li, Maria Tikhanovskaya, Peter Christian Norgaard, Nayantara Mudur, Martyna Beata Plomecka, Paul Raccuglia, Yasaman Bahri, Victor V. Albert, Pranesh Srinivasan, Haining Pan, Philippe Faist, Brian A Rohr, Michael J. Statt, Dan Morris, Drew Purves, Elise Kleeman, Ruth Alcantara, Matthew Abraham, Muqthar Mohammad, Ean Phing VanLee, Chenfei Jiang, Elizabeth Dorfman, Eun-Ah Kim, Michael Brenner, Sameera S Ponda, Subhashini Venugopalan NeurIPS
2025
HARDMath2: A Benchmark for Applied Mathematics Built by Students as Part of a Graduate Class
James V Roggeveen, Erik Y. Wang, David Ettel, Will Flintoft, Peter Donets, Raglan Ward, Ahmed Roman, Anton Marius Graf, Siddharth Dandavate, Ava Williamson, Felix Yeung, Kacper K Migacz, Yijun Wang, Egemen Bostan, Duy Thuc Nguyen, Zhe He, Marc L. Descoteaux, Anne Mykland, Shida Liu, Jorge GarcĂa Ponce, Luke Zhu, Yuyang Chen, Ekaterina S. Ivshina, Miguel Fernandez, Minjae Kim, Kennan Gumbs, Matthew Scott Tan, Russell Yang, Mai Hoang, David Brown, Isabella A Silveira, Lavon Sykes, Arjun Nageswaran, William Fredenberg, Yiming Chen, Lucas Martin, Yixing Tang, Kelly Werker Smith, Hongyu Liao, Logan G. Wilson, Alexander Dazhen Cai, Lucy S. Nathwani, Nickholas Gutierrez, Andrea Elizabeth Biju, Michael Brenner