Wen, Muning

11 publications

AAAI 2025 Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation Yiwei Shi, Muning Wen, Qi Zhang, Weinan Zhang, Cunjia Liu, Weiru Liu
NeurIPS 2025 MobileUse: A Hierarchical Reflection-Driven GUI Agent for Autonomous Mobile Operation Ning Li, Xiangmou Qu, Jiamu Zhou, Jun Wang, Muning Wen, Kounianhua Du, Xingyu Lou, Qiuying Peng, Jun Wang, Weinan Zhang
ICLR 2025 Robust Function-Calling for On-Device Language Model via Function Masking Qiqiang Lin, Muning Wen, Qiuying Peng, Guanyu Nie, Junwei Liao, Jun Wang, Xiaoyun Mo, Jiamu Zhou, Cheng Cheng, Yin Zhao, Jun Wang, Weinan Zhang
ICLR 2025 Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning Shangding Gu, Laixi Shi, Muning Wen, Ming Jin, Eric Mazumdar, Yuejie Chi, Adam Wierman, Costas Spanos
ICML 2024 AlphaZero-like Tree-Search Can Guide Large Language Model Decoding and Training Ziyu Wan, Xidong Feng, Muning Wen, Stephen Marcus Mcaleer, Ying Wen, Weinan Zhang, Jun Wang
NeurIPS 2024 Reinforcing LLM Agents via Policy Optimization with Action Decomposition Muning Wen, Ziyu Wan, Jun Wang, Weinan Zhang, Ying Wen
NeurIPSW 2023 AlphaZero-like Tree-Search Can Guide Large Language Model Decoding and Training Xidong Feng, Ziyu Wan, Muning Wen, Ying Wen, Weinan Zhang, Jun Wang
MLOSS 2023 MALib: A Parallel Framework for Population-Based Multi-Agent Reinforcement Learning Ming Zhou, Ziyu Wan, Hanjing Wang, Muning Wen, Runzhe Wu, Ying Wen, Yaodong Yang, Yong Yu, Jun Wang, Weinan Zhang
NeurIPS 2022 Multi-Agent Reinforcement Learning Is a Sequence Modeling Problem Muning Wen, Jakub Kuba, Runji Lin, Weinan Zhang, Ying Wen, Jun Wang, Yaodong Yang
ICLR 2022 Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning Jakub Grudzien Kuba, Ruiqing Chen, Muning Wen, Ying Wen, Fanglei Sun, Jun Wang, Yaodong Yang
NeurIPS 2021 Settling the Variance of Multi-Agent Policy Gradients Jakub Grudzien Kuba, Muning Wen, Linghui Meng, Shangding Gu, Haifeng Zhang, David Mguni, Jun Wang, Yaodong Yang