Weisz, Gellért
13 publications
NeurIPS
2023
Online RL in Linearly $q^\pi$-Realizable MDPs Is as Easy as in Linear MDPs if You Learn What to Ignore
NeurIPS
2023
Optimistic Natural Policy Gradient: A Simple Efficient Policy Optimization Framework for Online RL
NeurIPS
2022
Confident Approximate Policy Iteration for Efficient Local Planning in $q^\pi$-Realizable MDPs
COLT
2021
On Query-Efficient Planning in MDPs Under Linear Realizability of the Optimal State-Value Function