ML Anthology
Authors
Search
About
Guo, Hanxi
5 publications
ICCV
2025
JailbreakDiffBench: A Comprehensive Benchmark for Jailbreaking Diffusion Models
Xiaolong Jin
,
Zixuan Weng
,
Hanxi Guo
,
Chenlong Yin
,
Siyuan Cheng
,
Guangyu Shen
,
Xiangyu Zhang
NeurIPS
2024
BiScope: AI-Generated Text Detection by Checking Memorization of Preceding Tokens
Hanxi Guo
,
Siyuan Cheng
,
Xiaolong Jin
,
Zhuo Zhang
,
Kaiyuan Zhang
,
Guanhong Tao
,
Guangyu Shen
,
Xiangyu Zhang
NeurIPSW
2024
MultiVerse: Exposing Large Language Model Alignment Problems in Diverse Worlds
Xiaolong Jin
,
Zhuo Zhang
,
Guangyu Shen
,
Hanxi Guo
,
Kaiyuan Zhang
,
Siyuan Cheng
,
Xiangyu Zhang
NeurIPSW
2024
SkewAct: Red Teaming Large Language Models via Activation-Skewed Adversarial Prompt Optimization
Hanxi Guo
,
Siyuan Cheng
,
Guanhong Tao
,
Guangyu Shen
,
Zhuo Zhang
,
Shengwei An
,
Kaiyuan Zhang
,
Xiangyu Zhang
ECCV
2024
UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening
Siyuan Cheng
,
Guangyu Shen
,
Kaiyuan Zhang
,
Guanhong Tao
,
Shengwei An
,
Hanxi Guo
,
Shiqing Ma
,
Xiangyu Zhang