He, Jiafan
26 publications
NeurIPS
2024
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation
ICML
2023
On the Interplay Between Misspecification and Sub-Optimality Gap in Linear Contextual Bandits
AISTATS
2022
Near-Optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs
NeurIPS
2022
A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits