ML Anthology
Authors
Search
About
Fu, Shaopeng
6 publications
ICLRW
2025
"Short-Length" Adversarial Training Helps LLMs Defend "Long-Length" Jailbreak Attacks: Theoretical and Empirical Evidence
Shaopeng Fu
,
Liang Ding
,
Di Wang
NeurIPS
2025
Short-Length Adversarial Training Helps LLMs Defend Long-Length Jailbreak Attacks: Theoretical and Empirical Evidence
Shaopeng Fu
,
Liang Ding
,
Jingfeng Zhang
,
Di Wang
ICLRW
2025
Understanding Private Learning from Feature Perspective
Meng Ding
,
Mingxi Lei
,
Shaopeng Fu
,
Di Wang
,
Jinhui Xu
ICLR
2024
Theoretical Analysis of Robust Overfitting for Wide DNNs: An NTK Approach
Shaopeng Fu
,
Di Wang
ICLR
2022
Knowledge Removal in Sampling-Based Bayesian Inference
Shaopeng Fu
,
Fengxiang He
,
Dacheng Tao
ICLR
2022
Robust Unlearnable Examples: Protecting Data Privacy Against Adversarial Learning
Shaopeng Fu
,
Fengxiang He
,
Yang Liu
,
Li Shen
,
Dacheng Tao