Xiao, Chenjun

28 publications

JAIR 2025 An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models Yangchen Pan, Junfeng Wen, Chenjun Xiao, Philip H. S. Torr
ICML 2025 Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning Chen-Xiao Gao, Chenyang Wu, Mingjun Cao, Chenjun Xiao, Yang Yu, Zongzhang Zhang
ICLRW 2025 Large Language Model-Enhanced Multi-Armed Bandits Jiahang Sun, Zhiyong Wang, Runhan Yang, Chenjun Xiao, John C.S. Lui, Zhongxiang Dai
ICMLW 2024 An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models Yangchen Pan, Junfeng Wen, Chenjun Xiao, Philip Torr
NeurIPS 2024 Diffusion Spectral Representation for Reinforcement Learning Dmitry Shribak, Chen-Xiao Gao, Yitong Li, Chenjun Xiao, Bo Dai
NeurIPS 2024 Exploiting the Replay Memory Before Exploring the Environment: Enhancing Reinforcement Learning Through Empirical MDP Iteration Hongming Zhang, Chenjun Xiao, Chao Gao, Han Wang, Bo Xu, Martin Müller
ICML 2024 HarmonyDream: Task Harmonization Inside World Models Haoyu Ma, Jialong Wu, Ningya Feng, Chenjun Xiao, Dong Li, Jianye Hao, Jianmin Wang, Mingsheng Long
NeurIPS 2024 Iteratively Refined Behavior Regularization for Offline Reinforcement Learning Yi Ma, Jianye Hao, Xiaohan Hu, Yan Zheng, Chenjun Xiao
AAAI 2024 Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces Xiaotian Hao, Jianye Hao, Chenjun Xiao, Kai Li, Dong Li, Yan Zheng
ICML 2024 Provable Representation with Efficient Planning for Partially Observable Reinforcement Learning Hongming Zhang, Tongzheng Ren, Chenjun Xiao, Dale Schuurmans, Bo Dai
ICML 2024 Rethinking Decision Transformer via Hierarchical Reinforcement Learning Yi Ma, Jianye Hao, Hebin Liang, Chenjun Xiao
ICML 2024 Target Networks and Over-Parameterization Stabilize Off-Policy Bootstrapping with Function Approximation Fengdi Che, Chenjun Xiao, Jincheng Mei, Bo Dai, Ramki Gummadi, Oscar A Ramirez, Christopher K Harris, A. Rupam Mahmood, Dale Schuurmans
UAI 2023 Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning Xutong Zhao, Yangchen Pan, Chenjun Xiao, Sarath Chandar, Janarthanan Rajendran
UAI 2023 Energy-Based Predictive Representations for Partially Observed Reinforcement Learning Tianjun Zhang, Tongzheng Ren, Chenjun Xiao, Wenli Xiao, Joseph E. Gonzalez, Dale Schuurmans, Bo Dai
NeurIPSW 2023 Iteratively Refined Behavior Regularization for Offline Reinforcement Learning Xiaohan Hu, Yi Ma, Chenjun Xiao, Yan Zheng, Jianye Hao
ICLR 2023 Latent Variable Representation for Reinforcement Learning Tongzheng Ren, Chenjun Xiao, Tianjun Zhang, Na Li, Zhaoran Wang, Sujay Sanghavi, Dale Schuurmans, Bo Dai
ICLR 2023 Replay Memory as an Empirical MDP: Combining Conservative Estimation with Experience Replay Hongming Zhang, Chenjun Xiao, Han Wang, Jun Jin, Bo Xu, Martin Müller
ICLR 2023 The In-Sample SoftMax for Offline Reinforcement Learning Chenjun Xiao, Han Wang, Yangchen Pan, Adam White, Martha White
AISTATS 2022 The Curse of Passive Data Collection in Batch Reinforcement Learning Chenjun Xiao, Ilbin Lee, Bo Dai, Dale Schuurmans, Csaba Szepesvari
ICLR 2022 Understanding and Leveraging Overparameterization in Recursive Value Estimation Chenjun Xiao, Bo Dai, Jincheng Mei, Oscar A Ramirez, Ramki Gummadi, Chris Harris, Dale Schuurmans
ICML 2021 On the Optimality of Batch Policy Optimization Algorithms Chenjun Xiao, Yifan Wu, Jincheng Mei, Bo Dai, Tor Lattimore, Lihong Li, Csaba Szepesvari, Dale Schuurmans
NeurIPS 2021 Understanding the Effect of Stochasticity in Policy Optimization Jincheng Mei, Bo Dai, Chenjun Xiao, Csaba Szepesvari, Dale Schuurmans
NeurIPS 2020 Escaping the Gravitational Pull of SoftMax Jincheng Mei, Chenjun Xiao, Bo Dai, Lihong Li, Csaba Szepesvari, Dale Schuurmans
ICML 2020 On the Global Convergence Rates of SoftMax Policy Gradient Methods Jincheng Mei, Chenjun Xiao, Csaba Szepesvari, Dale Schuurmans
NeurIPS 2019 Maximum Entropy Monte-Carlo Planning Chenjun Xiao, Ruitong Huang, Jincheng Mei, Dale Schuurmans, Martin Müller
IJCAI 2019 On Principled Entropy Exploration in Policy Optimization Jincheng Mei, Chenjun Xiao, Ruitong Huang, Dale Schuurmans, Martin Müller
AAAI 2018 Memory-Augmented Monte Carlo Tree Search Chenjun Xiao, Jincheng Mei, Martin Müller
AAAI 2016 Factorization Ranking Model for Move Prediction in the Game of Go Chenjun Xiao, Martin Müller