Cen, Shicong

10 publications

TMLR 2026 Beyond Expectations: Learning with Stochastic Dominance Made Practical Shicong Cen, Jincheng Mei, Hanjun Dai, Dale Schuurmans, Yuejie Chi, Bo Dai
AISTATS 2025 Faster WIND: Accelerating Iterative Best-of-$n$ Distillation for LLM Alignment Tong Yang, Jincheng Mei, Hanjun Dai, Zixin Wen, Shicong Cen, Dale Schuurmans, Yuejie Chi, Bo Dai
ICLR 2025 Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF Shicong Cen, Jincheng Mei, Katayoon Goshvadi, Hanjun Dai, Tong Yang, Sherry Yang, Dale Schuurmans, Yuejie Chi, Bo Dai
JMLR 2024 Fast Policy Extragradient Methods for Competitive Games with Entropy Regularization Shicong Cen, Yuting Wei, Yuejie Chi
NeurIPS 2024 Federated Natural Policy Gradient and Actor Critic Methods for Multi-Task Reinforcement Learning Tong Yang, Shicong Cen, Yuting Wei, Yuxin Chen, Yuejie Chi
ICLR 2023 Asynchronous Gradient Play in Zero-Sum Multi-Agent Games Ruicheng Ao, Shicong Cen, Yuejie Chi
ICLR 2023 Faster Last-Iterate Convergence of Policy Optimization in Zero-Sum Markov Games Shicong Cen, Yuejie Chi, Simon Shaolei Du, Lin Xiao
NeurIPS 2021 Fast Policy Extragradient Methods for Competitive Games with Entropy Regularization Shicong Cen, Yuting Wei, Yuejie Chi
AISTATS 2020 Communication-Efficient Distributed Optimization in Networks with Gradient Tracking and Variance Reduction Boyue Li, Shicong Cen, Yuxin Chen, Yuejie Chi
JMLR 2020 Communication-Efficient Distributed Optimization in Networks with Gradient Tracking and Variance Reduction Boyue Li, Shicong Cen, Yuxin Chen, Yuejie Chi