Zhou, Runlong
12 publications
NeurIPSW
2023
Free from Bellman Completeness: Trajectory Stitching via Model-Based Return-Conditioned Supervised Learning
ICML
2023
Horizon-Free and Variance-Dependent Reinforcement Learning for Latent Markov Decision Processes
TMLR
2023
Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization