Duzan, Agatha

1 publications

NeurIPS 2025 OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents Thomas Kuntz, Agatha Duzan, Hao Zhao, Francesco Croce, J Zico Kolter, Nicolas Flammarion, Maksym Andriushchenko