Wang, Kaile

4 publications

NeurIPS 2025 InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback Boyuan Chen, Donghai Hong, Jiaming Ji, Jiacheng Zheng, Bowen Dong, Jiayi Zhou, Kaile Wang, Josef Dai, Xuyao Wang, Wenqi Chen, Qirui Zheng, Wenxin Li, Sirui Han, Yike Guo, Yaodong Yang
NeurIPS 2025 Safe RLHF-V: Safe Reinforcement Learning from Multi-Modal Human Feedback Jiaming Ji, Xinyu Chen, Rui Pan, Han Zhu, Jiahao Li, Donghai Hong, Boyuan Chen, Jiayi Zhou, Kaile Wang, Juntao Dai, Chi-Min Chan, Sirui Han, Yike Guo, Yaodong Yang
AAAI 2025 Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction Hantao Lou, Jiaming Ji, Kaile Wang, Yaodong Yang
NeurIPSW 2024 Language Models Resist Alignment Jiaming Ji, Kaile Wang, Tianyi Qiu, Boyuan Chen, Changye Li, Hantao Lou, Jiayi Zhou, Josef Dai, Yaodong Yang