Zheng, Xiaosen

8 publications

ICLR 2025 Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates Xiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, Jing Jiang, Min Lin
ICLR 2025 RegMix: Data Mixture as Regression for Language Model Pre-Training Qian Liu, Xiaosen Zheng, Niklas Muennighoff, Guangtao Zeng, Longxu Dou, Tianyu Pang, Jing Jiang, Min Lin
ICML 2024 Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast Xiangming Gu, Xiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, Ye Wang, Jing Jiang, Min Lin
ICLRW 2024 Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast Xiangming Gu, Xiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, Ye Wang, Jing Jiang, Min Lin
NeurIPSW 2024 Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates Xiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, Jing Jiang, Min Lin
NeurIPS 2024 Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses Xiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, Jing Jiang, Min Lin
ICMLW 2024 Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses Xiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, Jing Jiang, Min Lin
ICLR 2024 Intriguing Properties of Data Attribution on Diffusion Models Xiaosen Zheng, Tianyu Pang, Chao Du, Jing Jiang, Min Lin