ML Anthology
Authors
Search
About
Li, Yuetai
2 publications
ICLRW
2025
CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models
Yuetai Li
,
Zhangchen Xu
,
Fengqing Jiang
,
Luyao Niu
,
Dinuka Sahabandu
,
Bhaskar Ramasubramanian
,
Radha Poovendran
ICLRW
2025
SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities
Fengqing Jiang
,
Zhangchen Xu
,
Yuetai Li
,
Luyao Niu
,
Zhen Xiang
,
Bo Li
,
Bill Yuchen Lin
,
Radha Poovendran