ML Anthology
Authors
Search
About
Wu, Xiaorui
1 publications
NeurIPS
2025
EVOREFUSE: Evolutionary Prompt Optimization for Evaluation and Mitigation of LLM Over-Refusal to Pseudo-Malicious Instructions
Xiaorui Wu
,
Fei Li
,
Xiaofeng Mao
,
Xin Zhang
,
Li Zheng
,
Yuxiang Peng
,
Chong Teng
,
Donghong Ji
,
Zhuang Li