Jansen, Peter

5 publications

ICLR 2026 AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite Jonathan Bragg, Mike D'Arcy, Nishant Balepur, Dan Bareket, Bhavana Dalvi Mishra, Sergey Feldman, Dany Haddad, Jena D. Hwang, Peter Jansen, Varsha Kishore, Bodhisattwa Prasad Majumder, Aakanksha Naik, Sigal Rahamimov, Kyle Richardson, Amanpreet Singh, Harshit Surana, Aryeh Tiktinsky, Rosni Vasu, Guy Wiener, Chloe Anastasiades, Stefanus Candra, Jason Dunkelberger, Daniel Emery, Rob Evans, Malachi Hamada, Regan Huff, Rodney Kinney, Matt Latzke, Jaron Lochner, Ruben Lozano-Aguilera, Ngoc-Uyen Nguyen, Smita Rao, Amber Tanaka, Brooke Vlahos, Peter Clark, Doug Downey, Yoav Goldberg, Ashish Sabharwal, Daniel S Weld
ICLR 2025 From Models to Microtheories: Distilling a Model's Topical Knowledge for Grounded Question-Answering Nathaniel Weir, Bhavana Dalvi Mishra, Orion Weller, Oyvind Tafjord, Sam Hornstein, Alexander Sabol, Peter Jansen, Benjamin Van Durme, Peter Clark
NeurIPS 2024 DiscoveryWorld: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents Peter Jansen, Marc-Alexandre Côté, Tushar Khot, Erin Bransom, Bhavana Dalvi Mishra, Bodhisattwa Prasad Majumder, Oyvind Tafjord, Peter Clark
NeurIPSW 2023 CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization Bodhisattwa Prasad Majumder, Bhavana Dalvi Mishra, Peter Jansen, Oyvind Tafjord, Niket Tandon, Li Zhang, Chris Callison-Burch, Peter Clark
AAAI 2020 QASC: A Dataset for Question Answering via Sentence Composition Tushar Khot, Peter Clark, Michal Guerquin, Peter Jansen, Ashish Sabharwal