Yang, Zhuoran
143 publications
ICCV
2025
InstaDrive: Instance-Aware Driving World Models for Realistic and Consistent Video Generation
CoRL
2025
Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation
AISTATS
2025
What and How Does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization
NeurIPSW
2024
Can Neural Networks Achieve Optimal Computational-Statistical Tradeoff? an Analysis on Single-Index Model
ICML
2024
From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems
ICML
2024
Mean Field Langevin Actor-Critic: Faster Convergence and Global Optimality Beyond Lazy Learning
NeurIPS
2024
On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games
ICMLW
2024
STRIDE: A Tool-Assisted LLM Agent Framework for Strategic and Interactive Decision-Making
ICMLW
2024
STRIDE: A Tool-Assisted LLM Agent Framework for Strategic and Interactive Decision-Making
NeurIPS
2024
Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers
ICMLW
2024
Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers
ICLR
2023
Decentralized Optimistic Hyperpolicy Mirror Descent: Provably No-Regret Learning in Markov Games
NeurIPS
2023
Diffusion Model Is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
JMLR
2023
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning
AISTATS
2023
Finding Regularized Competitive Equilibria of Heterogeneous Agent Macroeconomic Models via Reinforcement Learning
ICML
2023
Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model
NeurIPS
2023
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration
NeurIPS
2023
Online Performative Gradient Descent for Learning Nash Equilibria in Decision-Dependent Games
NeurIPS
2022
Inducing Equilibria via Incentives: Simultaneous Design-and-Play Ensures Global Convergence
ICML
2022
Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets
ICLRW
2022
Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets
ICML
2022
Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes
ICLR
2022
Reinforcement Learning Under a Multi-Agent Predictive State Representation Model: Method and Theory
NeurIPS
2022
Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL
ICML
2022
Welfare Maximization in Competitive Equilibrium: Reinforcement Learning for Markov Exchange Economy
AISTATS
2021
Provably Efficient Actor-Critic for Risk-Sensitive and Robust Adversarial RL: A Linear-Quadratic Case
NeurIPSW
2021
ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement Learning
NeurIPS
2021
Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning
NeurIPS
2021
Offline Constrained Multi-Objective Reinforcement Learning via Pessimistic Dual Value Iteration
ICML
2021
On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game
ICML
2021
Randomized Exploration in Reinforcement Learning with General Value Function Approximation
NeurIPS
2021
Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of Representation Learning in Actor-Critic
NeurIPS
2020
Provably Efficient Neural Estimation of Structural Equation Models: An Adversarial Approach
NeurIPS
2020
Provably Efficient Reinforcement Learning with Kernel and Neural Function Approximations
NeurIPS
2019
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games
NeurIPS
2019
Provably Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost
NeurIPSW
2019
Robust One-Bit Recovery via ReLU Generative Networks: Improved Statistical Rate and Global Landscape Analysis
AISTATS
2018
Nonlinear Structured Signal Estimation in High Dimensions via Iterative Hard Thresholding
ICML
2017
High-Dimensional Non-Gaussian Single Index Models via Thresholded Score Function Estimation