Xu, Zhongwen

36 publications

ICLRW 2025 Pre-Trained Video Generative Models as World Simulators Haoran He, Yang Zhang, Liang Lin, Zhongwen Xu, Ling Pan
ICLR 2024 Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform Shengyi Huang, Jiayi Weng, Rujikorn Charakorn, Min Lin, Zhongwen Xu, Santiago Ontanon
IJCAI 2024 Reinforcement Learning from Diverse Human Preferences Wanqi Xue, Bo An, Shuicheng Yan, Zhongwen Xu
ICLR 2023 DaxBench: Benchmarking Deformable Object Manipulation with Differentiable Physics Siwei Chen, Yiqing Xu, Cunjun Yu, Linfeng Li, Xiao Ma, Zhongwen Xu, David Hsu
ICLR 2023 Distributional Meta-Gradient Reinforcement Learning Haiyan Yin, Shuicheng Yan, Zhongwen Xu
ICLR 2023 Efficient Offline Policy Optimization with a Learned Model Zichen Liu, Siyi Li, Wee Sun Lee, Shuicheng Yan, Zhongwen Xu
CVPR 2023 Imitation Learning as State Matching via Differentiable Physics Siwei Chen, Xiao Ma, Zhongwen Xu
NeurIPS 2023 Mutual Information Regularized Offline Reinforcement Learning Xiao Ma, Bingyi Kang, Zhongwen Xu, Min Lin, Shuicheng Yan
ICLR 2023 RPM: Generalizable Multi-Agent Policies for Multi-Agent Reinforcement Learning Wei Qiu, Xiao Ma, Bo An, Svetlana Obraztsova, Shuicheng Yan, Zhongwen Xu
AAAI 2023 Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning Yang Yue, Bingyi Kang, Zhongwen Xu, Gao Huang, Shuicheng Yan
ICLR 2023 Visual Imitation Learning with Patch Rewards Minghuan Liu, Tairan He, Weinan Zhang, Shuicheng Yan, Zhongwen Xu
NeurIPSW 2022 Boosting Offline Reinforcement Learning via Data Rebalancing Yang Yue, Bingyi Kang, Xiao Ma, Zhongwen Xu, Gao Huang, Shuicheng Yan
NeurIPSW 2022 Efficient Offline Policy Optimization with a Learned Model Zichen Liu, Siyi Li, Wee Sun Lee, Shuicheng Yan, Zhongwen Xu
NeurIPS 2022 EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine Jiayi Weng, Min Lin, Shengyi Huang, Bo Liu, Denys Makoviichuk, Viktor Makoviychuk, Zichen Liu, Yufan Song, Ting Luo, Yukun Jiang, Zhongwen Xu, Shuicheng Yan
NeurIPSW 2022 Mutual Information Regularized Offline Reinforcement Learning Xiao Ma, Bingyi Kang, Zhongwen Xu, Min Lin, Shuicheng Yan
NeurIPSW 2022 Visual Imitation Learning with Patch Rewards Minghuan Liu, Tairan He, Weinan Zhang, Shuicheng Yan, Zhongwen Xu
ICLR 2021 Balancing Constraints and Rewards with Meta-Gradient D4PG Dan A. Calian, Daniel J Mankowitz, Tom Zahavy, Zhongwen Xu, Junhyuk Oh, Nir Levine, Timothy Mann
NeurIPS 2021 Discovery of Options via Meta-Learned Subgoals Vivek Veeriah, Tom Zahavy, Matteo Hessel, Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado P van Hasselt, David Silver, Satinder P. Singh
ICML 2021 Emphatic Algorithms for Deep Reinforcement Learning Ray Jiang, Tom Zahavy, Zhongwen Xu, Adam White, Matteo Hessel, Charles Blundell, Hado Van Hasselt
NeurIPS 2020 A Self-Tuning Actor-Critic Algorithm Tom Zahavy, Zhongwen Xu, Vivek Veeriah, Matteo Hessel, Junhyuk Oh, Hado P van Hasselt, David Silver, Satinder P. Singh
NeurIPS 2020 Discovering Reinforcement Learning Algorithms Junhyuk Oh, Matteo Hessel, Wojciech M. Czarnecki, Zhongwen Xu, Hado P van Hasselt, Satinder P. Singh, David Silver
NeurIPS 2020 Meta-Gradient Reinforcement Learning with an Objective Discovered Online Zhongwen Xu, Hado P van Hasselt, Matteo Hessel, Junhyuk Oh, Satinder P. Singh, David Silver
ICML 2020 What Can Learned Intrinsic Rewards Capture? Zeyu Zheng, Junhyuk Oh, Matteo Hessel, Zhongwen Xu, Manuel Kroiss, Hado Van Hasselt, David Silver, Satinder Singh
NeurIPS 2019 Discovery of Useful Questions as Auxiliary Tasks Vivek Veeriah, Matteo Hessel, Zhongwen Xu, Janarthanan Rajendran, Richard L. Lewis, Junhyuk Oh, Hado P van Hasselt, David Silver, Satinder Singh
NeurIPS 2018 Meta-Gradient Reinforcement Learning Zhongwen Xu, Hado P van Hasselt, David Silver
IJCAI 2018 Watching a Small Portion Could Be as Good as Watching All: Towards Efficient Video Classification Hehe Fan, Zhongwen Xu, Linchao Zhu, Chenggang Yan, Jianjun Ge, Yi Yang
CVPR 2017 Bidirectional Multirate Reconstruction for Temporal Modeling in Videos Linchao Zhu, Zhongwen Xu, Yi Yang
CVPR 2017 Few-Shot Object Recognition from Machine-Labeled Web Images Zhongwen Xu, Linchao Zhu, Yi Yang
NeurIPS 2017 Natural Value Approximators: Learning When to Trust past Estimates Zhongwen Xu, Joseph Modayil, Hado P van Hasselt, Andre Barreto, David Silver, Tom Schaul
CVPR 2016 Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning Pingbo Pan, Zhongwen Xu, Yi Yang, Fei Wu, Yueting Zhuang
AAAI 2016 Robust Semi-Supervised Learning Through Label Aggregation Yan Yan, Zhongwen Xu, Ivor W. Tsang, Guodong Long, Yi Yang
CVPR 2015 A Discriminative CNN Video Representation for Event Detection Zhongwen Xu, Yi Yang, Alex G. Hauptmann
CVPR 2014 Event Detection Using Multi-Level Relevance Labels and Multiple Features Zhongwen Xu, Ivor W. Tsang, Yi Yang, Zhigang Ma, Alexander G. Hauptmann
CVPR 2013 Complex Event Detection via Multi-Source Video Attributes Zhigang Ma, Yi Yang, Zhongwen Xu, Shuicheng Yan, Nicu Sebe, Alexander G. Hauptmann
ICCV 2013 Feature Weighting via Optimal Thresholding for Video Analysis Zhongwen Xu, Yi Yang, Ivor Tsang, Nicu Sebe, Alexander G. Hauptmann
ICCV 2013 How Related Exemplars Help Complex Event Detection in Web Videos? Yi Yang, Zhigang Ma, Zhongwen Xu, Shuicheng Yan, Alexander G. Hauptmann