ML Anthology
Authors
Search
About
Wang, Ruipeng
2 publications
ICLR
2026
DualEdit: Mitigating Safety Fallback in LLM Backdoor Editing via Affirmation-Refusal Regulation
Houcheng Jiang
,
Zetong Zhao
,
Junfeng Fang
,
Haokai Ma
,
Ruipeng Wang
,
Xiang Wang
,
Xiangnan He
,
Yang Deng
NeurIPS
2024
Towards Neuron Attributions in Multi-Modal Large Language Models
Junfeng Fang
,
Zongze Bi
,
Ruipeng Wang
,
Houcheng Jiang
,
Yuan Gao
,
Kun Wang
,
An Zhang
,
Jie Shi
,
Xiang Wang
,
Tat-Seng Chua