Carvalho, Diogo S.

2 publications

TMLR 2025 Multi-Bellman Operator for Convergence of $q$-Learning with Linear Function Approximation Diogo S. Carvalho, Pedro A. Santos, Francisco S. Melo
MLJ 2024 The Impact of Data Distribution on Q-Learning with Function Approximation Pedro P. Santos, Diogo S. Carvalho, Alberto Sardinha, Francisco S. Melo