ML Anthology
Authors
Search
About
McDaniel, Patrick
5 publications
ICLR
2025
AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs
Xiaogeng Liu
,
Peiran Li
,
G. Edward Suh
,
Yevgeniy Vorobeychik
,
Zhuoqing Mao
,
Somesh Jha
,
Patrick McDaniel
,
Huan Sun
,
Bo Li
,
Chaowei Xiao
ICLR
2025
Can Watermarks Be Used to Detect LLM IP Infringement for Free?
Zhengyue Zhao
,
Xiaogeng Liu
,
Somesh Jha
,
Patrick McDaniel
,
Bo Li
,
Chaowei Xiao
ICCV
2025
On the Robustness Tradeoff in Fine-Tuning
Kunyang Li
,
Jean-Charles Noirot Ferrand
,
Ryan Sheatsley
,
Blaine Hoak
,
Yohan Beugin
,
Eric Pauley
,
Patrick McDaniel
NeurIPS
2024
BackdoorAlign: Mitigating Fine-Tuning Based Jailbreak Attack with Backdoor Enhanced Safety Alignment
Jiongxiao Wang
,
Jiazhao Li
,
Yiquan Li
,
Xiangyu Qi
,
Junjie Hu
,
Yixuan Li
,
Patrick McDaniel
,
Muhao Chen
,
Bo Li
,
Chaowei Xiao
ICLR
2018
Ensemble Adversarial Training: Attacks and Defenses
Florian Tramèr
,
Alexey Kurakin
,
Nicolas Papernot
,
Ian Goodfellow
,
Dan Boneh
,
Patrick McDaniel