Chen, Liyu

22 publications

AAAI 2025 Effective Diffusion Transformer Architecture for Image Super-Resolution Kun Cheng, Lei Yu, Zhijun Tu, Xiao He, Liyu Chen, Yong Guo, Mingrui Zhu, Nannan Wang, Xinbo Gao, Jie Hu
ICML 2025 Reward-Augmented Data Enhances Direct Preference Alignment of LLMs Shenao Zhang, Zhihan Liu, Boyi Liu, Yufeng Zhang, Yingxiang Yang, Yongfei Liu, Liyu Chen, Tao Sun, Zhaoran Wang
ICLRW 2025 Reward-Augmented Data Enhances Direct Preference Alignment of LLMs Shenao Zhang, Zhihan Liu, Boyi Liu, Yufeng Zhang, Yingxiang Yang, Yongfei Liu, Liyu Chen, Tao Sun, Zhaoran Wang
ICML 2025 Teaching Language Models to Critique via Reinforcement Learning Zhihui Xie, Jie Chen, Liyu Chen, Weichao Mao, Jingjing Xu, Lingpeng Kong
ICLRW 2025 Teaching Language Models to Critique via Reinforcement Learning Zhihui Xie, Jie Chen, Liyu Chen, Weichao Mao, Jingjing Xu, Lingpeng Kong
ICLR 2024 $\mathcal{B}$-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis Zishun Yu, Yunzhe Tao, Liyu Chen, Tao Sun, Hongxia Yang
NeurIPSW 2023 $\mathcal{B}$-Coder: On Value-Based Deep Reinforcement Learning for Program Synthesis Zishun Yu, Yunzhe Tao, Liyu Chen, Tao Sun, Hongxia Yang
ICML 2023 Layered State Discovery for Incremental Autonomous Exploration Liyu Chen, Andrea Tirinzoni, Alessandro Lazaric, Matteo Pirotta
UAI 2023 Posterior Sampling-Based Online Learning for the Stochastic Shortest Path Model Mehdi Jafarnia-Jahromi, Liyu Chen, Rahul Jain, Haipeng Luo
ALT 2023 Reaching Goals Is Hard: Settling the Sample Complexity of the Stochastic Shortest Path Liyu Chen, Andrea Tirinzoni, Matteo Pirotta, Alessandro Lazaric
AISTATS 2022 Policy Learning and Evaluation with Randomized Quasi-Monte Carlo Sébastien M. R. Arnold, Pierre L’Ecuyer, Liyu Chen, Yi-Fan Chen, Fei Sha
NeurIPS 2022 Follow-the-Perturbed-Leader for Adversarial Markov Decision Processes with Bandit Feedback Yan Dai, Haipeng Luo, Liyu Chen
ICML 2022 Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDP Liyu Chen, Rahul Jain, Haipeng Luo
ICML 2022 Learning Infinite-Horizon Average-Reward Markov Decision Process with Constraints Liyu Chen, Rahul Jain, Haipeng Luo
NeurIPS 2022 Near-Optimal Goal-Oriented Reinforcement Learning in Non-Stationary Environments Liyu Chen, Haipeng Luo
COLT 2022 Policy Optimization for Stochastic Shortest Path Liyu Chen, Haipeng Luo, Aviv Rosenberg
ICML 2021 Finding the Stochastic Shortest Path with Low Regret: The Adversarial Cost and Unknown Transition Case Liyu Chen, Haipeng Luo
NeurIPS 2021 Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path Liyu Chen, Mehdi Jafarnia-Jahromi, Rahul Jain, Haipeng Luo
COLT 2021 Impossible Tuning Made Possible: A New Expert Algorithm and Its Applications Liyu Chen, Haipeng Luo, Chen-Yu Wei
COLT 2021 Minimax Regret for Stochastic Shortest Path with Adversarial Costs and Known Transition Liyu Chen, Haipeng Luo, Chen-Yu Wei
IJCAI 2019 Hyper-Parameter Tuning Under a Budget Constraint Zhiyun Lu, Liyu Chen, Chao-Kai Chiang, Fei Sha
NeurIPS 2018 Synthesized Policies for Transfer and Adaptation Across Tasks and Environments Hexiang Hu, Liyu Chen, Boqing Gong, Fei Sha