Stroebl, Benedikt

5 publications

TMLR 2025 AI Agents That Matter Sayash Kapoor, Benedikt Stroebl, Zachary S Siegel, Nitya Nadgir, Arvind Narayanan
NeurIPS 2025 Dynamic Risk Assessments for Offensive Cybersecurity Agents Boyi Wei, Benedikt Stroebl, Jiacen Xu, Joie Zhang, Zhou Li, Peter Henderson
UAI 2025 Hindsight Merging: Diverse Data Generation with Language Models Veniamin Veselovsky, Benedikt Stroebl, Gianluca Bencomo, Dilip Arumugam, Lisa Schut, Arvind Narayanan, Thomas L. Griffiths
NeurIPS 2025 Information Retrieval Induced Safety Degradation in AI Agents Cheng Yu, Benedikt Stroebl, Diyi Yang, Orestis Papakyriakopoulos
TMLR 2024 CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark Zachary S Siegel, Sayash Kapoor, Nitya Nadgir, Benedikt Stroebl, Arvind Narayanan