Wei, Chen-Yu

46 publications

NeurIPS 2025 An Improved Algorithm for Adversarial Linear Contextual Bandits via Reduction Tim van Erven, Jack Mayo, Julia Olkhovskaya, Chen-Yu Wei
COLT 2025 Decision Making in Hybrid Environments: A Model Aggregation Approach Haolin Liu, Chen-Yu Wei, Zimmert Julian
NeurIPS 2025 From Average-Iterate to Last-Iterate Convergence in Games: A Reduction and Its Applications Yang Cai, Haipeng Luo, Chen-Yu Wei, Weiqiang Zheng
NeurIPS 2024 Beating Adversarial Low-Rank MDPs with Unknown Transition and Bandit Feedback Haolin Liu, Zakaria Mhammedi, Chen-Yu Wei, Julian Zimmert
NeurIPS 2024 Corruption-Robust Linear Bandits: Minimax Optimality and Gap-Dependent Misspecification Haolin Liu, Artin Tajdini, Andrew Wagenmaker, Chen-Yu Wei
NeurIPS 2024 How Does Variance Shape the Regret in Contextual Bandits? Zeyu Jia, Jian Qian, Alexander Rakhlin, Chen-Yu Wei
AISTATS 2024 Near-Optimal Policy Optimization for Correlated Equilibrium in General-Sum Markov Games Yang Cai, Haipeng Luo, Chen-Yu Wei, Weiqiang Zheng
COLT 2024 Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data Zeyu Jia, Alexander Rakhlin, Ayush Sekhari, Chen-Yu Wei
NeurIPS 2024 On Tractable $\Phi$-Equilibria in Non-Concave Games Yang Cai, Constantinos Daskalakis, Haipeng Luo, Chen-Yu Wei, Weiqiang Zheng
ICLR 2024 Towards Optimal Regret in Adversarial Linear MDPs with Bandit Feedback Haolin Liu, Chen-Yu Wei, Julian Zimmert
COLT 2023 A Blackbox Approach to Best of Both Worlds in Bandits and Beyond Chris Dann, Chen-Yu Wei, Julian Zimmert
ALT 2023 A Unified Algorithm for Stochastic Path Problems Christoph Dann, Chen-Yu Wei, Julian Zimmert
ICML 2023 Best of Both Worlds Policy Optimization Christoph Dann, Chen-Yu Wei, Julian Zimmert
NeurIPS 2023 Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits Haolin Liu, Chen-Yu Wei, Julian Zimmert
NeurIPS 2023 First- and Second-Order Bounds for Adversarial Linear Contextual Bandits Julia Olkhovskaya, Jack Mayo, Tim van Erven, Gergely Neu, Chen-Yu Wei
NeurIPS 2023 Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs Dongsheng Ding, Chen-Yu Wei, Kaiqing Zhang, Alejandro Ribeiro
NeurIPS 2023 No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions Tiancheng Jin, Junyan Liu, Chloé Rouyer, William Chang, Chen-Yu Wei, Haipeng Luo
ICML 2023 Refined Regret for Adversarial MDPs with Linear Function Approximation Yan Dai, Haipeng Luo, Chen-Yu Wei, Julian Zimmert
ICMLW 2023 Uncoupled and Convergent Learning in Two-Player Zero-Sum Markov Games Yang Cai, Haipeng Luo, Chen-Yu Wei, Weiqiang Zheng
NeurIPS 2023 Uncoupled and Convergent Learning in Two-Player Zero-Sum Markov Games with Bandit Feedback Yang Cai, Haipeng Luo, Chen-Yu Wei, Weiqiang Zheng
ALT 2022 A Model Selection Approach for Corruption Robust Reinforcement Learning Chen-Yu Wei, Christoph Dann, Julian Zimmert
ALT 2022 Decentralized Cooperative Reinforcement Learning with Hierarchical Information Structure Hsu Kao, Chen-Yu Wei, Vijay Subramanian
ICML 2022 Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence Dongsheng Ding, Chen-Yu Wei, Kaiqing Zhang, Mihailo Jovanovic
ICML 2022 Personalization Improves Privacy-Accuracy Tradeoffs in Federated Learning Alberto Bietti, Chen-Yu Wei, Miroslav Dudik, John Langford, Steven Wu
AISTATS 2021 Learning Infinite-Horizon Average-Reward MDPs with Linear Function Approximation Chen-Yu Wei, Mehdi Jafarnia Jahromi, Haipeng Luo, Rahul Jain
ICML 2021 Achieving near Instance-Optimality and Minimax-Optimality in Stochastic and Adversarial Linear Bandits Simultaneously Chung-Wei Lee, Haipeng Luo, Chen-Yu Wei, Mengxiao Zhang, Xiaojin Zhang
ALT 2021 Adversarial Online Learning with Changing Action Sets: Efficient Algorithms with Approximate Regret Bounds Ehsan Emamjomeh-Zadeh, Chen-Yu Wei, Haipeng Luo, David Kempe
COLT 2021 Impossible Tuning Made Possible: A New Expert Algorithm and Its Applications Liyu Chen, Haipeng Luo, Chen-Yu Wei
COLT 2021 Last-Iterate Convergence of Decentralized Optimistic Gradient Descent/Ascent in Infinite-Horizon Competitive Markov Games Chen-Yu Wei, Chung-Wei Lee, Mengxiao Zhang, Haipeng Luo
ICLR 2021 Linear Last-Iterate Convergence in Constrained Saddle-Point Optimization Chen-Yu Wei, Chung-Wei Lee, Mengxiao Zhang, Haipeng Luo
COLT 2021 Minimax Regret for Stochastic Shortest Path with Adversarial Costs and Known Transition Liyu Chen, Haipeng Luo, Chen-Yu Wei
COLT 2021 Non-Stationary Reinforcement Learning Without Prior Knowledge: An Optimal Black-Box Approach Chen-Yu Wei, Haipeng Luo
NeurIPS 2021 Policy Optimization in Adversarial MDPs: Improved Exploration via Dilated Bonuses Haipeng Luo, Chen-Yu Wei, Chung-Wei Lee
NeurIPS 2020 Bias No More: High-Probability Data-Dependent Regret Bounds for Adversarial Bandits and MDPs Chung-Wei Lee, Haipeng Luo, Chen-Yu Wei, Mengxiao Zhang
ICML 2020 Model-Free Reinforcement Learning in Infinite-Horizon Average-Reward Markov Decision Processes Chen-Yu Wei, Mehdi Jafarnia Jahromi, Haipeng Luo, Hiteshi Sharma, Rahul Jain
COLT 2020 Taking a Hint: How to Leverage Loss Predictors in Contextual Bandits? Chen-Yu Wei, Haipeng Luo, Alekh Agarwal
COLT 2019 A New Algorithm for Non-Stationary Contextual Bandits: Efficient, Optimal and Parameter-Free Yifang Chen, Chung-Wei Lee, Haipeng Luo, Chen-Yu Wei
COLT 2019 Achieving Optimal Dynamic Regret for Non-Stationary Bandits Without Prior Information Peter Auer, Yifang Chen, Pratik Gajane, Chung-Wei Lee, Haipeng Luo, Ronald Ortner, Chen-Yu Wei
ICML 2019 Bandit Multiclass Linear Classification: Efficient Algorithms for the Separable Case Alina Beygelzimer, David Pal, Balazs Szorenyi, Devanathan Thiruvenkatachari, Chen-Yu Wei, Chicheng Zhang
ICML 2019 Beating Stochastic and Adversarial Semi-Bandits Optimally and Simultaneously Julian Zimmert, Haipeng Luo, Chen-Yu Wei
COLT 2019 Improved Path-Length Regret Bounds for Bandits Sébastien Bubeck, Yuanzhi Li, Haipeng Luo, Chen-Yu Wei
COLT 2018 Efficient Contextual Bandits in Non-Stationary Worlds Haipeng Luo, Chen-Yu Wei, Alekh Agarwal, John Langford
NeurIPS 2018 Efficient Online Portfolio with Logarithmic Regret Haipeng Luo, Chen-Yu Wei, Kai Zheng
COLT 2018 More Adaptive Algorithms for Adversarial Bandits Chen-Yu Wei, Haipeng Luo
NeurIPS 2017 Online Reinforcement Learning in Stochastic Games Chen-Yu Wei, Yi-Te Hong, Chi-Jen Lu
NeurIPS 2016 Tracking the Best Expert in Non-Stationary Stochastic Environments Chen-Yu Wei, Yi-Te Hong, Chi-Jen Lu