Dai, Juntao
8 publications
NeurIPS
2025
Safe RLHF-V: Safe Reinforcement Learning from Multi-Modal Human Feedback
Jiaming Ji, Xinyu Chen, Rui Pan, Han Zhu, Jiahao Li, Donghai Hong, Boyuan Chen, Jiayi Zhou, Kaile Wang, Juntao Dai, Chi-Min Chan, Sirui Han, Yike Guo, Yaodong Yang