Yang, Mengyue

16 publications

TMLR 2026 The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Guibin Zhang, Hejia Geng, Xiaohang Yu, Zhenfei Yin, Zaibin Zhang, Zelin Tan, Heng Zhou, Zhong-Zhi Li, Xiangyuan Xue, Yijiang Li, Yifan Zhou, Yang Chen, Chen Zhang, Yutao Fan, Zihu Wang, Songtao Huang, Francisco Piedrahita Velez, Yue Liao, Hongru Wang, Mengyue Yang, Heng Ji, Jun Wang, Shuicheng Yan, Philip Torr, Lei Bai
NeurIPS 2025 A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning Anjie Liu, Jianhong Wang, Samuel Kaski, Jun Wang, Mengyue Yang
ICLR 2025 Causal Representation Learning from Multimodal Biomedical Observations Yuewen Sun, Lingjing Kong, Guangyi Chen, Loka Li, Gongxu Luo, Zijian Li, Yixuan Zhang, Yujia Zheng, Mengyue Yang, Petar Stojanov, Eran Segal, Eric P. Xing, Kun Zhang
NeurIPS 2025 Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning Xiangning Yu, Zhuohan Wang, Linyi Yang, Haoxuan Li, Anjie Liu, Xiao Xue, Jun Wang, Mengyue Yang
NeurIPS 2025 Curious Causality-Seeking Agents in Open-Ended Worlds Zhiyu Zhao, Haoxuan Li, Haifeng Zhang, Jun Wang, Francesco Faccio, Jürgen Schmidhuber, Mengyue Yang
NeurIPS 2025 Decentralized Dynamic Cooperation of Personalized Models for Federated Continual Learning Danni Yang, Zhikang Chen, Sen Cui, Mengyue Yang, Ding Li, Abudukelimu Wuerkaixi, Haoxuan Li, Jinke Ren, Mingming Gong
ICLR 2025 Efficient Reinforcement Learning with Large Language Model Priors Xue Yan, Yan Song, Xidong Feng, Mengyue Yang, Haifeng Zhang, Haitham Bou Ammar, Jun Wang
ICML 2025 Large Language Models Are Demonstration Pre-Selectors for Themselves Jiarui Jin, Yuwei Wu, Haoxuan Li, Xiaoting He, Weinan Zhang, Yiming Yang, Yong Yu, Jun Wang, Mengyue Yang
NeurIPS 2025 MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model Framework Qirui Mi, Mengyue Yang, Xiangning Yu, Zhiyu Zhao, Cheng Deng, Bo An, Haifeng Zhang, Xu Chen, Jun Wang
ICLRW 2025 Natural Language Reinforcement Learning Xidong Feng, Bo Liu, Ziyu Wan, Haotian Fu, Girish A. Koushik, Zhiyuan Hu, Mengyue Yang, Ying Wen, Jun Wang
NeurIPS 2025 Unveiling Extraneous Sampling Bias with Data Missing-Not-at-Random Chunyuan Zheng, Haocheng Yang, Haoxuan Li, Mengyue Yang
ICML 2025 When Can Proxies Improve the Sample Complexity of Preference Learning? Yuchen Zhu, Daniel Augusto De Souza, Zhengyan Shi, Mengyue Yang, Pasquale Minervini, Matt Kusner, Alexander D’Amour
NeurIPS 2023 ChessGPT: Bridging Policy Learning and Language Modeling Xidong Feng, Yicheng Luo, Ziyan Wang, Hongrui Tang, Mengyue Yang, Kun Shao, David Mguni, Yali Du, Jun Wang
NeurIPS 2023 Invariant Learning via Probability of Sufficient and Necessary Causes Mengyue Yang, Zhen Fang, Yonggang Zhang, Yali Du, Furui Liu, Jean-Francois Ton, Jianhong Wang, Jun Wang
NeurIPS 2023 Lending Interaction Wings to Recommender Systems with Conversational Agents Jiarui Jin, Xianyu Chen, Fanghua Ye, Mengyue Yang, Yue Feng, Weinan Zhang, Yong Yu, Jun Wang
CVPR 2021 CausalVAE: Disentangled Representation Learning via Neural Structural Causal Models Mengyue Yang, Furui Liu, Zhitang Chen, Xinwei Shen, Jianye Hao, Jun Wang