Ying, Lei
25 publications
NeurIPS
2025
Achieving $\tilde{\mathcal{O}}(1/N)$ Optimality Gap in Restless Bandits Through Gaussian Approximation
ICLR
2025
Zeroth-Order Policy Gradient for Reinforcement Learning from Human Feedback Without Reward Inference
AAAI
2024
Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration
NeurIPS
2023
Fast and Regret Optimal Best Arm Identification: Fundamental Limits and Low-Complexity Algorithms
AISTATS
2023
Learning While Scheduling in Multi-Server Systems with Unknown Statistics: MaxWeight with Discounted UCB
AAAI
2022
Batch Active Learning with Graph Neural Networks via Multi-Agent Deep Reinforcement Learning
NeurIPS
2022
Online Convex Optimization with Hard Constraints: Towards the Best of Two Worlds and Beyond
NeurIPS
2021
An Efficient Pessimistic-Optimistic Algorithm for Stochastic Linear Bandits with General Constraints