Dong, Kuicai

3 publications

NeurIPS 2025 Benchmarking Retrieval-Augmented Multimomal Generation for Document Question Answering Kuicai Dong, Yujing Chang, Shijie Huang, Yasheng Wang, Ruiming Tang, Yong Liu
NeurIPS 2025 Process vs. Outcome Reward: Which Is Better for Agentic RAG Reinforcement Learning Wenlin Zhang, Xiangyang Li, Kuicai Dong, Yichao Wang, Pengyue Jia, Xiaopeng Li, Yingyi Zhang, Derong Xu, Zhaocheng Du, Huifeng Guo, Ruiming Tang, Xiangyu Zhao
ICMLW 2024 Aligning Crowd Feedback via Distributional Preference Reward Modeling Dexun Li, Cong Zhang, Kuicai Dong, Derrick Goh Xin Deik, Ruiming Tang, Yong Liu