Yang, Zhihe

4 publications

NeurIPS 2025 ADG: Ambient Diffusion-Guided Dataset Recovery for Corruption-Robust Offline Reinforcement Learning Zeyuan Liu, Zhihe Yang, Jiawei Xu, Rui Yang, Jiafei Lyu, Baoxiang Wang, Yunjian Xu, Xiu Li
CVPR 2025 Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key Zhihe Yang, Xufang Luo, Dongqi Han, Yunjian Xu, Dongsheng Li
ICML 2025 Q-Supervised Contrastive Representation: A State Decoupling Framework for Safe Offline Reinforcement Learning Zhihe Yang, Yunjian Xu, Yang Zhang
ICLR 2024 DMBP: Diffusion Model-Based Predictor for Robust Offline Reinforcement Learning Against State Observation Perturbations Zhihe Yang, Yunjian Xu