Fang, Zhengwei

4 publications

ICML 2025 STAIR: Improving Safety Alignment with Introspective Reasoning Yichi Zhang, Siyuan Zhang, Yao Huang, Zeyu Xia, Zhengwei Fang, Xiao Yang, Ranjie Duan, Dong Yan, Yinpeng Dong, Jun Zhu
NeurIPS 2024 MultiTrust: A Comprehensive Benchmark Towards Trustworthy Multimodal Large Language Models Yichi Zhang, Yao Huang, Yitong Sun, Chang Liu, Zhe Zhao, Zhengwei Fang, Yifan Wang, Huanran Chen, Xiao Yang, Xingxing Wei, Hang Su, Yinpeng Dong, Jun Zhu
CVPR 2024 Strong Transferable Adversarial Attacks via Ensembled Asymptotically Normal Distribution Learning Zhengwei Fang, Rui Wang, Tao Huang, Liping Jing
NeurIPSW 2023 How Robust Is Google's Bard to Adversarial Image Attacks? Yinpeng Dong, Huanran Chen, Jiawei Chen, Zhengwei Fang, Xiao Yang, Yichi Zhang, Yu Tian, Hang Su, Jun Zhu