Stroebl, Benedikt

5 publications

TMLR 2025 AI Agents That Matter Sayash Kapoor, Benedikt Stroebl, Zachary S Siegel, Nitya Nadgir, Arvind Narayanan

NeurIPS 2025 Dynamic Risk Assessments for Offensive Cybersecurity Agents Boyi Wei, Benedikt Stroebl, Jiacen Xu, Joie Zhang, Zhou Li, Peter Henderson

UAI 2025 Hindsight Merging: Diverse Data Generation with Language Models Veniamin Veselovsky, Benedikt Stroebl, Gianluca Bencomo, Dilip Arumugam, Lisa Schut, Arvind Narayanan, Thomas L. Griffiths

NeurIPS 2025 Information Retrieval Induced Safety Degradation in AI Agents Cheng Yu, Benedikt Stroebl, Diyi Yang, Orestis Papakyriakopoulos

TMLR 2024 CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark Zachary S Siegel, Sayash Kapoor, Nitya Nadgir, Benedikt Stroebl, Arvind Narayanan