Xie, Zhihui

8 publications

ICLR 2025 Jailbreaking as a Reward Misspecification Problem Zhihui Xie, Jiahui Gao, Lei Li, Zhenguo Li, Qi Liu, Lingpeng Kong
ICML 2025 Teaching Language Models to Critique via Reinforcement Learning Zhihui Xie, Jie Chen, Liyu Chen, Weichao Mao, Jingjing Xu, Lingpeng Kong
ICLRW 2025 Teaching Language Models to Critique via Reinforcement Learning Zhihui Xie, Jie Chen, Liyu Chen, Weichao Mao, Jingjing Xu, Lingpeng Kong
CVPR 2025 VL-RewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models Lei Li, Yuancheng Wei, Zhihui Xie, Xuqing Yang, Yifan Song, Peiyi Wang, Chenxin An, Tianyu Liu, Sujian Li, Bill Yuchen Lin, Lingpeng Kong, Qi Liu
NeurIPS 2024 Calibrating Reasoning in Language Models with Internal Consistency Zhihui Xie, Jizhou Guo, Tong Yu, Shuai Li
NeurIPS 2024 Learning Versatile Skills with Curriculum Masking Yao Tang, Zhihui Xie, Zichuan Lin, Deheng Ye, Shuai Li
ICML 2023 Future-Conditioned Unsupervised Pretraining for Decision Transformer Zhihui Xie, Zichuan Lin, Deheng Ye, Qiang Fu, Yang Wei, Shuai Li
ECCV 2020 Layered Neighborhood Expansion for Incremental Multiple Graph Matching Zixuan Chen, Zhihui Xie, Junchi Yan Yinqiang Zheng, Xiaokang Yang