Chen, Jiankang
5 publications
ICLR
2026
R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning
YiFan Zhang, Xingyu Lu, Xiao Hu, Chaoyou Fu, Bin Wen, Tianke Zhang, Changyi Liu, Kaiyu Jiang, Kaibing Chen, Kaiyu Tang, Haojie Ding, Jiankang Chen, Fan Yang, Zhang Zhang, Tingting Gao, Di Zhang, Guorui Zhou, Liang Wang ICLR
2026
Thyme: Think Beyond Images
YiFan Zhang, Xingyu Lu, Shukang Yin, Chaoyou Fu, Wei Chen, Xiao Hu, Bin Wen, Kaiyu Jiang, Changyi Liu, Tianke Zhang, Haonan Fan, Kaibing Chen, Jiankang Chen, Haojie Ding, Kaiyu Tang, Zhang Zhang, Liang Wang, Fan Yang, Tingting Gao, Guorui Zhou ICLR
2025
TaskGalaxy: Scaling Multi-Modal Instruction Fine-Tuning with Tens of Thousands Vision Task Types
Jiankang Chen, Tianke Zhang, Changyi Liu, Haojie Ding, Yaya Shi, Cheng.Feng, Huihui Xiao, Bin Wen, Fan Yang, Tingting Gao, Di Zhang