Zhang, Wancong

2 publications

NeurIPS 2025 Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models Vlad Sobal, Wancong Zhang, Kyunghyun Cho, Randall Balestriero, Tim G. J. Rudner, Yann LeCun
ICLRW 2025 Stress-Testing Offline Reward-Free Reinforcement Learning: A Case for Planning with Latent Dynamics Models Vlad Sobal, Wancong Zhang, Kyunghyun Cho, Randall Balestriero, Tim G. J. Rudner, Yann LeCun