Bai, Qinbo

10 publications

NeurIPS 2025 Global Convergence for Average Reward Constrained MDPs with Primal-Dual Actor Critic Algorithm Yang Xu, Swetha Ganesh, Washim Uddin Mondal, Qinbo Bai, Vaneet Aggarwal
NeurIPS 2024 Learning General Parameterized Policies for Infinite Horizon Average Reward Constrained MDPs via Primal-Dual Policy Gradient Algorithm Qinbo Bai, Washim Uddin Mondal, Vaneet Aggarwal
AAAI 2024 Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes Qinbo Bai, Washim Uddin Mondal, Vaneet Aggarwal
AAAI 2023 Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm Qinbo Bai, Amrit Singh Bedi, Vaneet Aggarwal
JMLR 2023 Provably Sample-Efficient Model-Free Algorithm for MDPs with Peak Constraints Qinbo Bai, Vaneet Aggarwal, Ather Gattami
AAAI 2022 Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach Qinbo Bai, Amrit Singh Bedi, Mridul Agarwal, Alec Koppel, Vaneet Aggarwal
TMLR 2022 Concave Utility Reinforcement Learning with Zero-Constraint Violations Mridul Agarwal, Qinbo Bai, Vaneet Aggarwal
JAIR 2022 Joint Optimization of Concave Scalarized Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm Qinbo Bai, Mridul Agarwal, Vaneet Aggarwal
UAI 2022 Regret Guarantees for Model-Based Reinforcement Learning with Long-Term Average Constraints Mridul Agarwal, Qinbo Bai, Vaneet Aggarwal
AISTATS 2021 Reinforcement Learning for Constrained Markov Decision Processes Ather Gattami, Qinbo Bai, Vaneet Aggarwal