Lv, Guannan

1 publications

ICLR 2026 DIVA-GRPO: Enhancing Multimodal Reasoning Through Difficulty-Adaptive Variant Advantage Haowen Gao, Zhenyu Zhang, Liang Pang, Fangda Guo, Douhongjian, Guannan Lv, ShaoGuo Liu, Tingting Gao, Huawei Shen, Xueqi Cheng