Wang, Mengdi
134 publications
ICLR
2025
Diffusion Transformer Captures Spatial-Temporal Dependencies: A Theory for Gaussian Process Data
CVPR
2025
Immune: Improving Safety Against Jailbreaks in Multi-Modal LLMs via Inference-Time Alignment
NeurIPS
2025
Securing the Language of Life: Inheritable Watermarks from DNA Language Models to Proteins
NeurIPS
2025
Training-Free Guidance Beyond Differentiability: Scalable Path Steering with Tree Search in Diffusion and Flow Models
ICMLW
2024
MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences
NeurIPS
2024
Nonparametric Classification on Low Dimensional Manifolds Using Overparameterized Convolutional Residual Networks
JMLR
2024
On the Sample Complexity and Metastability of Heavy-Tailed Policy Search in Continuous Control
ICLR
2024
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback
AISTATS
2024
Policy Evaluation for Reinforcement Learning from Human Feedback: A Sample Complexity Analysis
JMLR
2023
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning
NeurIPS
2023
Efficient RL with Impaired Observability: Learning to Act with Delayed and Missing State Observations
JMLR
2023
Learning Good State and Action Representations for Markov Decision Process via Tensor Decomposition
NeurIPSW
2023
Nonparametric Classification on Low Dimensional Manifolds Using Overparameterized Convolutional Residual Networks
ICLR
2023
Offline Reinforcement Learning with Differentiable Function Approximation Is Provably Efficient
NeurIPS
2023
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation
NeurIPS
2023
Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement
NeurIPS
2022
Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization
ICML
2022
Efficient Reinforcement Learning in Block MDPs: A Model-Free Representation Learning Approach
L4DC
2020
A Duality Approach for Regret Minimization in Average-Award Ergodic Markov Decision Processes
NeurIPS
2020
Provably Efficient Reinforcement Learning with Kernel and Neural Function Approximations
AISTATS
2020
Solving Discounted Stochastic Two-Player Games with Near-Optimal Time and Sample Complexity
NeurIPS
2019
Learning Low-Dimensional State Embeddings and Metastable Clusters from Time Series Data
NeurIPS
2018
Dimensionality Reduction for Stationary Time Series via Stochastic Nonconvex Optimization
NeurIPS
2018
Near-Optimal Time and Sample Complexities for Solving Markov Decision Processes with a Generative Model