Sotnikov, Dmitry

2 publications

ICML 2024 Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback Asaf Cassel, Haipeng Luo, Aviv Rosenberg, Dmitry Sotnikov
ICML 2023 Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback Tal Lancewicki, Aviv Rosenberg, Dmitry Sotnikov