Zhao, Dongbin
19 publications
CoRL
2025
FetchBot: Learning Generalizable Object Fetching in Cluttered Scenes via Zero-Shot Sim2Real
AAAI
2025
In-Dataset Trajectory Return Regularization for Offline Preference-Based Reinforcement Learning
NeurIPS
2025
Learning When to Think: Shaping Adaptive Reasoning in R1-Style Models via Multi-Stage RL
CoRL
2025
ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-Loop Autonomous Driving
ICLR
2025
Unsupervised Zero-Shot Reinforcement Learning via Dual-Value Forward-Backward Representation
NeurIPS
2025
Videos Are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations
ICCV
2025
World4Drive: End-to-End Autonomous Driving via Intention-Aware Physical Latent World Model
NeurIPS
2024
Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization