Wagner, David

13 publications

NeurIPSW 2024 A Closer Look at System Message Robustness Norman Mu, Jonathan Lu, Michael Lavery, David Wagner
ICMLW 2024 Certifiably Robust RAG Against Retrieval Corruption Chong Xiang, Tong Wu, Zexuan Zhong, David Wagner, Danqi Chen, Prateek Mittal
ICLR 2024 PubDef: Defending Against Transfer Attacks from Public Models Chawin Sitawarin, Jaewon Chang, David Huang, Wesson Altoyan, David Wagner
NeurIPSW 2024 Stronger Universal and Transfer Attacks by Suppressing Refusals David Huang, Avidan Shah, Alexandre Araujo, David Wagner, Chawin Sitawarin
NeurIPS 2024 Toxicity Detection for Free Zhanhao Hu, Julien Piet, Geng Zhao, Jiantao Jiao, David Wagner
ICLR 2023 Part-Based Models Improve Adversarial Robustness Chawin Sitawarin, Kornrapat Pongmala, Yizheng Chen, Nicholas Carlini, David Wagner
ICCV 2023 REAP: A Large-Scale Realistic Adversarial Patch Benchmark Nabeel Hingun, Chawin Sitawarin, Jerry Li, David Wagner
ICML 2022 Demystifying the Adversarial Robustness of Random Transformation Defenses Chawin Sitawarin, Zachary J Golan-Strieb, David Wagner
NeurIPSW 2022 Part-Based Models Improve Adversarial Robustness Chawin Sitawarin, Kornrapat Pongmala, Yizheng Chen, Nicholas Carlini, David Wagner
NeurIPSW 2022 REAP: A Large-Scale Realistic Adversarial Patch Benchmark Nabeel Hingun, Chawin Sitawarin, Jerry Li, David Wagner
ECCV 2022 SLIP: Self-Supervision Meets Language-Image Pre-Training Norman Mu, Alexander Kirillov, David Wagner, Saining Xie
NeurIPS 2021 Adversarial Examples for K-Nearest Neighbor Classifiers Based on Higher-Order Voronoi Diagrams Chawin Sitawarin, Evgenios Kornaropoulos, Dawn Song, David Wagner
ICML 2018 Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples Anish Athalye, Nicholas Carlini, David Wagner