Fu, Shaopeng

6 publications

ICLRW 2025 "Short-Length" Adversarial Training Helps LLMs Defend "Long-Length" Jailbreak Attacks: Theoretical and Empirical Evidence Shaopeng Fu, Liang Ding, Di Wang
NeurIPS 2025 Short-Length Adversarial Training Helps LLMs Defend Long-Length Jailbreak Attacks: Theoretical and Empirical Evidence Shaopeng Fu, Liang Ding, Jingfeng Zhang, Di Wang
ICLRW 2025 Understanding Private Learning from Feature Perspective Meng Ding, Mingxi Lei, Shaopeng Fu, Di Wang, Jinhui Xu
ICLR 2024 Theoretical Analysis of Robust Overfitting for Wide DNNs: An NTK Approach Shaopeng Fu, Di Wang
ICLR 2022 Knowledge Removal in Sampling-Based Bayesian Inference Shaopeng Fu, Fengxiang He, Dacheng Tao
ICLR 2022 Robust Unlearnable Examples: Protecting Data Privacy Against Adversarial Learning Shaopeng Fu, Fengxiang He, Yang Liu, Li Shen, Dacheng Tao