Sun, Mingfei

10 publications

TMLR 2025 Regularized Gradient Clipping Provably Trains Wide and Deep Neural Networks Matteo Tucat, Anirbit Mukherjee, Mingfei Sun, Procheta Sen, Omar Rivasplata
CoLLAs 2023 Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation Massimiliano Patacchiola, Mingfei Sun, Katja Hofmann, Richard E. Turner
ICLR 2023 Imitating Human Behaviour with Diffusion Models Tim Pearce, Tabish Rashid, Anssi Kanervisto, Dave Bignell, Mingfei Sun, Raluca Georgescu, Sergio Valcarcel Macua, Shan Zheng Tan, Ida Momennejad, Katja Hofmann, Sam Devlin
NeurIPS 2023 SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning Benjamin Ellis, Jonathan Cook, Skander Moalla, Mikayel Samvelyan, Mingfei Sun, Anuj Mahajan, Jakob Foerster, Shimon Whiteson
AAAI 2022 Deterministic and Discriminative Imitation (d2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency Mingfei Sun, Sam Devlin, Katja Hofmann, Shimon Whiteson
NeurIPSW 2022 Imitating Human Behaviour with Diffusion Models Tim Pearce, Tabish Rashid, Anssi Kanervisto, David Bignell, Mingfei Sun, Raluca Georgescu, Sergio Valcarcel Macua, Shan Zheng Tan, Ida Momennejad, Katja Hofmann, Sam Devlin
ICLRW 2022 Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers Micah Carroll, Jessy Lin, Orr Paradise, Raluca Georgescu, Mingfei Sun, David Bignell, Stephanie Milani, Katja Hofmann, Matthew Hausknecht, Anca Dragan, Sam Devlin
NeurIPS 2022 Uni[MASK]: Unified Inference in Sequential Decision Problems Micah Carroll, Orr Paradise, Jessy Lin, Raluca Georgescu, Mingfei Sun, David Bignell, Stephanie Milani, Katja Hofmann, Matthew Hausknecht, Anca Dragan, Sam Devlin
AAAI 2020 Mastering Complex Control in MOBA Games with Deep Reinforcement Learning Deheng Ye, Zhao Liu, Mingfei Sun, Bei Shi, Peilin Zhao, Hao Wu, Hongsheng Yu, Shaojie Yang, Xipeng Wu, Qingwei Guo, Qiaobo Chen, Yinyuting Yin, Hao Zhang, Tengfei Shi, Liang Wang, Qiang Fu, Wei Yang, Lanxiao Huang
IJCAI 2019 Adversarial Imitation Learning from Incomplete Demonstrations Mingfei Sun, Xiaojuan Ma