Gan, Yaozhong

7 publications

ICCV 2025 Entropy-Adaptive Diffusion Policy Optimization with Dynamic Step Alignment RenYe Yan, Jikang Cheng, Yaozhong Gan, Shikun Sun, You Wu, Yunfan Yang, Liang Ling, Jinlong Lin, Yeshuang Zhu, Jie Zhou, Jinchao Zhang, Junliang Xing, Yimao Cai, Ru Huang
ICLR 2024 PAE: Reinforcement Learning from External Knowledge for Efficient Exploration Zhe Wu, Haofei Lu, Junliang Xing, You Wu, Renye Yan, Yaozhong Gan, Yuanchun Shi
ICML 2024 Reflective Policy Optimization Yaozhong Gan, Renye Yan, Zhe Wu, Junliang Xing
AAAI 2022 Robust Action Gap Increasing with Clipped Advantage Learning Zhe Zhang, Yaozhong Gan, Xiaoyang Tan
AAAI 2022 Smoothing Advantage Learning Yaozhong Gan, Zhe Zhang, Xiaoyang Tan
AAAI 2021 Stabilizing Q Learning via Soft Mellowmax Operator Yaozhong Gan, Zhe Zhang, Xiaoyang Tan
NeurIPS 2019 Trust Region-Guided Proximal Policy Optimization Yuhui Wang, Hao He, Xiaoyang Tan, Yaozhong Gan