Liu, Boyi
17 publications
NeurIPS
2024
Provably Mitigating Overoptimization in RLHF: Your SFT Loss Is Implicitly an Adversarial Regularizer
ICMLW
2024
Provably Mitigating Overoptimization in RLHF: Your SFT Loss Is Implicitly an Adversarial Regularizer
JMLR
2023
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning
NeurIPS
2023
Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms
IJCAI
2022
Dynamic Graph Learning Based on Hierarchical Memory for Origin-Destination Demand Prediction
NeurIPS
2022
Inducing Equilibria via Incentives: Simultaneous Design-and-Play Ensures Global Convergence