Evtimov, Ivan

10 publications

NeurIPS 2025 AdvPrefix: An Objective for Nuanced LLM Jailbreaks Sicheng Zhu, Brandon Amos, Yuandong Tian, Chuan Guo, Ivan Evtimov
NeurIPS 2025 AgentDAM: Privacy Leakage Evaluation for Autonomous Web Agents Arman Zharmagambetov, Chuan Guo, Ivan Evtimov, Maya Pavlova, Ruslan Salakhutdinov, Kamalika Chaudhuri
ICML 2025 Automated Red Teaming with GOAT: The Generative Offensive Agent Tester Maya Pavlova, Erik Brinkman, Krithika Iyer, Vı́tor Albiero, Joanna Bitton, Hailey Nguyen, Cristian Canton Ferrer, Ivan Evtimov, Aaron Grattafiori
ICLRW 2025 Automated Red Teaming with GOAT: The Generative Offensive Agent Tester Maya Pavlova, Erik Brinkman, Krithika Iyer, Vítor Albiero, Joanna Bitton, Hailey Nguyen, Cristian Canton Ferrer, Ivan Evtimov, Aaron Grattafiori
ICLR 2025 Persistent Pre-Training Poisoning of LLMs Yiming Zhang, Javier Rando, Ivan Evtimov, Jianfeng Chi, Eric Michael Smith, Nicholas Carlini, Florian Tramèr, Daphne Ippolito
NeurIPS 2025 WASP: Benchmarking Web Agent Security Against Prompt Injection Attacks Ivan Evtimov, Arman Zharmagambetov, Aaron Grattafiori, Chuan Guo, Kamalika Chaudhuri
CVPR 2023 A Whac-a-Mole Dilemma: Shortcuts Come in Multiples Where Mitigating One Amplifies Others Zhiheng Li, Ivan Evtimov, Albert Gordo, Caner Hazirbas, Tal Hassner, Cristian Canton Ferrer, Chenliang Xu, Mark Ibrahim
ICCVW 2023 Confusing Large Models by Confusing Small Models Vítor Albiero, Raghav Mehta, Ivan Evtimov, Samuel J. Bell, Levent Sagun, Aram Markosyan
ICLR 2023 ImageNet-X: Understanding Model Mistakes with Factor of Variation Annotations Badr Youbi Idrissi, Diane Bouchacourt, Randall Balestriero, Ivan Evtimov, Caner Hazirbas, Nicolas Ballas, Pascal Vincent, Michal Drozdzal, David Lopez-Paz, Mark Ibrahim
ICMLW 2021 Disrupting Model Training with Adversarial Shortcuts Ivan Evtimov, Ian Connick Covert, Aditya Kusupati, Tadayoshi Kohno