Sun, Yihao
11 publications
ICLR
2026
ADM-V2: Pursuing Full-Horizon Roll-Out in Dynamics Models for Offline Policy Learning and Evaluation
ICLR
2025
Any-Step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning
ICLR
2024
Flow to Better: Offline Preference-Based Reinforcement Learning via Preferred Trajectory Generation