Liu, Qinghua
19 publications
ICML
2025
Is Best-of-N the Best of Them? Coverage, Scaling, and Optimality in Inference-Time Alignment
NeurIPS
2023
Optimistic Natural Policy Gradient: A Simple Efficient Policy Optimization Framework for Online RL
ICML
2022
Learning Markov Games with Adversarial Opponents: Efficient Algorithms and Fundamental Limits