Ozdaglar, Asuman
24 publications
NeurIPS
2023
A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games
AISTATS
2023
Symmetric (Optimistic) Natural Policy Gradient for Multi-Agent Learning with Parameter Convergence
NeurIPS
2023
Time-Reversed Dissipation Induces Duality Between Minimizing Gradient Norm and Function Value
COLT
2020
Last Iterate Is Slower than Averaged Iterate in Smooth Convex-Concave Saddle Point Problems