Zhang, Yufeng
18 publications
AISTATS
2025
What and How Does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization
NeurIPS
2023
On the Properties of Kullback-Leibler Divergence Between Multivariate Gaussian Distributions
ICML
2022
Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes
AISTATS
2021
Provably Efficient Actor-Critic for Risk-Sensitive and Robust Adversarial RL: A Linear-Quadratic Case
NeurIPS
2021
Offline Constrained Multi-Objective Reinforcement Learning via Pessimistic Dual Value Iteration