ML Anthology
Authors
Search
About
Duzan, Agatha
1 publications
NeurIPS
2025
OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents
Thomas Kuntz
,
Agatha Duzan
,
Hao Zhao
,
Francesco Croce
,
J Zico Kolter
,
Nicolas Flammarion
,
Maksym Andriushchenko