Weisz, Gellért

13 publications

NeurIPS 2024 Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear $q^\pi$-Realizability and Concentrability Volodymyr Tkachuk, Gellért Weisz, Csaba Szepesvári
COLT 2023 Exponential Hardness of Reinforcement Learning with Linear Function Approximation Sihan Liu, Gaurav Mahajan, Daniel Kane, Shachar Lovett, Gellért Weisz, Csaba Szepesvári
NeurIPS 2023 Online RL in Linearly $q^\pi$-Realizable MDPs Is as Easy as in Linear MDPs if You Learn What to Ignore Gellert Weisz, András György, Csaba Szepesvari
NeurIPS 2023 Optimistic Natural Policy Gradient: A Simple Efficient Policy Optimization Framework for Online RL Qinghua Liu, Gellert Weisz, András György, Chi Jin, Csaba Szepesvari
NeurIPS 2022 Confident Approximate Policy Iteration for Efficient Local Planning in $q^\pi$-Realizable MDPs Gellért Weisz, András György, Tadashi Kozuno, Csaba Szepesvari
ALT 2022 TensorPlan and the Few Actions Lower Bound for Planning in MDPs Under Linear Realizability of Optimal Value Functions Gellért Weisz, Csaba Szepesvári, András György
ALT 2021 Exponential Lower Bounds for Planning in MDPs with Linearly-Realizable Optimal Action-Value Functions Gellért Weisz, Philip Amortila, Csaba Szepesvári
COLT 2021 On Query-Efficient Planning in MDPs Under Linear Realizability of the Optimal State-Value Function Gellert Weisz, Philip Amortila, Barnabás Janzer, Yasin Abbasi-Yadkori, Nan Jiang, Csaba Szepesvari
NeurIPS 2020 ImpatientCapsAndRuns: Approximately Optimal Algorithm Configuration from an Infinite Pool Gellert Weisz, András György, Wei-I Lin, Devon Graham, Kevin Leyton-Brown, Csaba Szepesvari, Brendan Lucier
ICML 2020 Learning with Good Feature Representations in Bandits and in RL with a Generative Model Tor Lattimore, Csaba Szepesvari, Gellert Weisz
ICML 2019 CapsAndRuns: An Improved Method for Approximately Optimal Algorithm Configuration Gellert Weisz, Andras Gyorgy, Csaba Szepesvari
ICML 2019 POLITEX: Regret Bounds for Policy Iteration Using Expert Prediction Yasin Abbasi-Yadkori, Peter Bartlett, Kush Bhatia, Nevena Lazic, Csaba Szepesvari, Gellert Weisz
ICML 2018 LeapsAndBounds: A Method for Approximately Optimal Algorithm Configuration Gellert Weisz, Andras Gyorgy, Csaba Szepesvari