Papini, Matteo
29 publications
NeurIPS
2024
Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning
NeurIPS
2022
Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits