Wang, Yansheng

2 publications

ICLR 2026 Buffer Matters: Unleashing the Power of Off-Policy Reinforcement Learning in Large Language Model Reasoning Xu Wan, Yansheng Wang, Wenqi Huang, Mingyang Sun
AAAI 2020 Federated Latent Dirichlet Allocation: A Local Differential Privacy Based Framework Yansheng Wang, Yongxin Tong, Dingyuan Shi