Zhao, Li

38 publications

ICML 2025 AdaptiveStep: Automatically Dividing Reasoning Step Through Model Confidence Yuliang Liu, Junjie Lu, Chaofeng Qu, Zhaoling Chen, Zefan Cai, Jason Klein Liu, Chonghan Liu, Yunhui Xia, Li Zhao, Jiang Bian, Chuheng Zhang, Wei Shen, Zhouhan Lin
ICML 2025 DPO Meets PPO: Reinforced Token Optimization for RLHF Han Zhong, Zikang Shan, Guhao Feng, Wei Xiong, Xinle Cheng, Li Zhao, Di He, Jiang Bian, Liwei Wang
NeurIPS 2025 Dyn-O: Building Structured World Models with Object-Centric Representations Zizhao Wang, Kaixin Wang, Li Zhao, Peter Stone, Jiang Bian
AAAI 2025 One-Shot Reference-Based Structure-Aware Image to Sketch Synthesis Rui Yang, Honghong Yang, Li Zhao, Qin Lei, Mianxiong Dong, Kaoru Ota, Xiaojun Wu
ICML 2025 Policy Filtration for RLHF to Mitigate Noise in Reward Models Chuheng Zhang, Wei Shen, Li Zhao, Xuyun Zhang, Xiaolong Xu, Wanchun Dou, Jiang Bian
ICLR 2025 Video In-Context Learning: Autoregressive Transformers Are Zero-Shot Video Imitators Wentao Zhang, Junliang Guo, Tianyu He, Li Zhao, Linli Xu, Jiang Bian
NeurIPS 2025 What Do Latent Action Models Actually Learn? Chuheng Zhang, Tim Pearce, Pushi Zhang, Kaixin Wang, Xiaoyu Chen, Wei Shen, Li Zhao, Jiang Bian
ICMLW 2024 DPO Meets PPO: Reinforced Token Optimization for RLHF Han Zhong, Guhao Feng, Wei Xiong, Xinle Cheng, Li Zhao, Di He, Jiang Bian, Liwei Wang
IJCAI 2024 Diversification of Adaptive Policy for Effective Offline Reinforcement Learning Yunseon Choi, Li Zhao, Chuheng Zhang, Lei Song, Jiang Bian, Kee-Eung Kim
AAAI 2024 VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning Tangfei Liao, Xiaoqin Zhang, Li Zhao, Tao Wang, Guobao Xiao
NeurIPS 2023 Distributional Pareto-Optimal Multi-Objective Reinforcement Learning Xin-Qiang Cai, Pushi Zhang, Li Zhao, Jiang Bian, Masashi Sugiyama, Ashley Llorens
AAAI 2023 H-TSP: Hierarchically Solving the Large-Scale Traveling Salesman Problem Xuanhao Pan, Yan Jin, Yuandong Ding, Mingxiao Feng, Li Zhao, Lei Song, Jiang Bian
AAAI 2023 Pointerformer: Deep Reinforced Multi-Pointer Transformer for the Traveling Salesman Problem Yan Jin, Yuandong Ding, Xuanhao Pan, Kun He, Li Zhao, Tao Qin, Lei Song, Jiang Bian
ICML 2023 Robust Situational Reinforcement Learning in Face of Context Disturbances Jinpeng Zhang, Yufeng Zheng, Chuheng Zhang, Li Zhao, Lei Song, Yuan Zhou, Jiang Bian
IJCAI 2023 Towards Generalizable Reinforcement Learning for Trade Execution Chuheng Zhang, Yitong Duan, Xiaoyu Chen, Jianyu Chen, Jian Li, Li Zhao
NeurIPS 2022 An Adaptive Deep RL Method for Non-Stationary Environments with Piecewise Stable Context Xiaoyu Chen, Xiangming Zhu, Yufeng Zheng, Pushi Zhang, Li Zhao, Wenxue Cheng, Peng Cheng, Yongqiang Xiong, Tao Qin, Jianyu Chen, Tie-Yan Liu
NeurIPS 2022 Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret Jiawei Huang, Li Zhao, Tao Qin, Wei Chen, Nan Jiang, Tie-Yan Liu
ICLR 2022 Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality Jiawei Huang, Jinglin Chen, Li Zhao, Tao Qin, Nan Jiang, Tie-Yan Liu
NeurIPS 2021 Curriculum Offline Imitating Learning Minghuan Liu, Hanye Zhao, Zhengyu Yang, Jian Shen, Weinan Zhang, Li Zhao, Tie-Yan Liu
NeurIPS 2021 Distributional Reinforcement Learning for Multi-Dimensional Reward Functions Pushi Zhang, Xiaoyu Chen, Li Zhao, Wei Xiong, Tao Qin, Tie-Yan Liu
IJCAI 2021 Independence-Aware Advantage Estimation Pushi Zhang, Li Zhao, Guoqing Liu, Jiang Bian, Minlie Huang, Tao Qin, Tie-Yan Liu
NeurIPS 2021 Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning Jongjin Park, Younggyo Seo, Chang Liu, Li Zhao, Tao Qin, Jinwoo Shin, Tie-Yan Liu
ICLR 2021 Return-Based Contrastive Representation Learning for Reinforcement Learning Guoqing Liu, Chuheng Zhang, Li Zhao, Tao Qin, Jinhua Zhu, Li Jian, Nenghai Yu, Tie-Yan Liu
NeurIPS 2020 RD$^2$: Reward Decomposition with Representation Decomposition Zichuan Lin, Derek Yang, Li Zhao, Tao Qin, Guangwen Yang, Tie-Yan Liu
NeurIPS 2019 Distributional Reward Decomposition for Reinforcement Learning Zichuan Lin, Li Zhao, Derek Yang, Tao Qin, Tie-Yan Liu, Guangwen Yang
NeurIPS 2019 Fully Parameterized Quantile Function for Distributional Reinforcement Learning Derek Yang, Li Zhao, Zichuan Lin, Tao Qin, Jiang Bian, Tie-Yan Liu
AAAI 2019 Trust Region Evolution Strategies Guoqing Liu, Li Zhao, Feidiao Yang, Jiang Bian, Tao Qin, Nenghai Yu, Tie-Yan Liu
ACML 2019 Unified Policy Optimization for Robust Reinforcement Learning Zichuan Lin, Li Zhao, Jiang Bian, Tao Qin, Guangwen Yang
ACML 2018 Adversarial Neural Machine Translation Lijun Wu, Yingce Xia, Fei Tian, Li Zhao, Tao Qin, Jianhuang Lai, Tie-Yan Liu
AAAI 2018 Dual Transfer Learning for Neural Machine Translation with Marginal Distribution Regularization Yijun Wang, Yingce Xia, Li Zhao, Jiang Bian, Tao Qin, Guiquan Liu, Tie-Yan Liu
AAAI 2018 Learning Structured Representation for Text Classification via Reinforcement Learning Tianyang Zhang, Minlie Huang, Li Zhao
AAAI 2018 Reinforcement Learning for Relation Classification from Noisy Data Jun Feng, Minlie Huang, Li Zhao, Yang Yang, Xiaoyan Zhu
AAAI 2018 Word Attention for Sequence to Sequence Text Understanding Lijun Wu, Fei Tian, Li Zhao, Jianhuang Lai, Tie-Yan Liu
IJCAI 2017 Sequence Prediction with Unlabeled Data by Reward Function Learning Lijun Wu, Li Zhao, Tao Qin, Jianhuang Lai, Tie-Yan Liu
AAAI 2016 Semi-Supervised Multinomial Naive Bayes for Text Classification by Leveraging Word-Level Statistical Constraint Li Zhao, Minlie Huang, Ziyu Yao, Rongwei Su, Yingying Jiang, Xiaoyan Zhu
CVPR 2005 3D Measurements in Cargo Inspection with a Gamma-Ray Linear Pushbroom Stereo System Zhigang Zhu, Li Zhao, Jiayan Lei
CVPRW 2005 3D Measurements in Cargo Inspection with a Gamma-Ray Linear Pushbroom Stereo System Zhigang Zhu, Li Zhao, Jiayan Lei
NeCo 2004 A Modified Algorithm for Generalized Discriminant Analysis Wenming Zheng, Li Zhao, Cairong Zou