Xu, Ruifeng
22 publications
ICLR
2026
GEPO: Group Expectation Policy Optimization for Stable Heterogeneous Reinforcement Learning
NeurIPS
2024
II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models
AAAI
2021
Exploring Auxiliary Reasoning Tasks for Task-Oriented Dialog Systems with Meta Cooperative Learning