ML Anthology
Authors
Search
About
Wang, Yuanhao
17 publications
ICML
2025
Securing Equal Share: A Principled Approach for Learning Multiplayer Symmetric Games
Jiawei Ge
,
Yuanhao Wang
,
Wenzhe Li
,
Chi Jin
NeurIPS
2024
Directional Smoothness and Gradient Methods: Convergence and Adaptivity
Aaron Mishkin
,
Ahmed Khaled
,
Yuanhao Wang
,
Aaron Defazio
,
Robert M. Gower
COLT
2023
Breaking the Curse of Multiagency: Provably Efficient Decentralized Multi-Agent RL with Function Approximation
Yuanhao Wang
,
Qinghua Liu
,
Yu Bai
,
Chi Jin
NeurIPS
2023
Is RLHF More Difficult than Standard RL? a Theoretical Perspective
Yuanhao Wang
,
Qinghua Liu
,
Chi Jin
NeurIPS
2023
Learning Adaptive Tensorial Density Fields for Clean Cryo-ET Reconstruction
Yuanhao Wang
,
Ramzi Idoughi
,
Wolfgang Heidrich
ICLR
2023
Learning Rationalizable Equilibria in Multiplayer Games
Yuanhao Wang
,
Dingwen Kong
,
Yu Bai
,
Chi Jin
AISTATS
2022
Near-Optimal Local Convergence of Alternating Gradient Descent-Ascent for Minimax Optimization
Guodong Zhang
,
Yuanhao Wang
,
Laurent Lessard
,
Roger B. Grosse
ICML
2022
Learning Markov Games with Adversarial Opponents: Efficient Algorithms and Fundamental Limits
Qinghua Liu
,
Yuanhao Wang
,
Chi Jin
ICLRW
2022
V-Learning -- a Simple, Efficient, Decentralized Algorithm for Multiagent RL
Chi Jin
,
Qinghua Liu
,
Yuanhao Wang
,
Tiancheng Yu
AISTATS
2021
On the Suboptimality of Negative Momentum for Minimax Optimization
Guodong Zhang
,
Yuanhao Wang
NeurIPS
2021
An Exponential Lower Bound for Linearly Realizable MDP with Constant Suboptimality Gap
Yuanhao Wang
,
Ruosong Wang
,
Sham Kakade
ICML
2021
Online Learning in Unknown Markov Games
Yi Tian
,
Yuanhao Wang
,
Tiancheng Yu
,
Suvrit Sra
ICLR
2020
Distributed Bandit Learning: Near-Optimal Regret with Efficient Communication
Yuanhao Wang
,
Jiachen Hu
,
Xiaoyu Chen
,
Liwei Wang
NeurIPS
2020
Improved Algorithms for Convex-Concave Minimax Optimization
Yuanhao Wang
,
Jian Li
ICLR
2020
On Solving Minimax Optimization Locally: A Follow-the-Ridge Approach
Yuanhao Wang
,
Guodong Zhang
,
Jimmy Ba
ICLR
2020
Q-Learning with UCB Exploration Is Sample Efficient for Infinite-Horizon MDP
Kefan Dong
,
Yuanhao Wang
,
Xiaoyu Chen
,
Liwei Wang
ECCV
2020
Stereo Event-Based Particle Tracking Velocimetry for 3D Fluid Flow Reconstruction
Yuanhao Wang
,
Ramzi Idoughi
,
Wolfgang Heidrich