ML Anthology
Authors
Search
About
Douhongjian
1 publications
ICLR
2026
DIVA-GRPO: Enhancing Multimodal Reasoning Through Difficulty-Adaptive Variant Advantage
Haowen Gao
,
Zhenyu Zhang
,
Liang Pang
,
Fangda Guo
,
Douhongjian
,
Guannan Lv
,
ShaoGuo Liu
,
Tingting Gao
,
Huawei Shen
,
Xueqi Cheng