Ning, Shan
5 publications
ICLR
2026
Wiki-R1: Incentivizing Multimodal Reasoning for Knowledge-Based VQA via Data and Sampling Curriculum
TMLR
2025
DA-DPO: Cost-Efficient Difficulty-Aware Preference Optimization for Reducing MLLM Hallucinations
5 publications