Santos, Pedro A.

1 publications

TMLR 2025 Multi-Bellman Operator for Convergence of $q$-Learning with Linear Function Approximation Diogo S. Carvalho, Pedro A. Santos, Francisco S. Melo