Zhang, Zhilong
14 publications
ICLR
2025
Any-Step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning
ICLR
2024
Flow to Better: Offline Preference-Based Reinforcement Learning via Preferred Trajectory Generation
14 publications