ML Anthology
Authors
Search
About
Ding, Dongsheng
14 publications
NeurIPS
2025
Alignment of Large Language Models with Constrained Learning
Botong Zhang
,
Shuo Li
,
Ignacio Hounie
,
Osbert Bastani
,
Dongsheng Ding
,
Alejandro Ribeiro
NeurIPS
2025
Composition and Alignment of Diffusion Models Using Constrained Learning
Shervin Khalafi
,
Ignacio Hounie
,
Dongsheng Ding
,
Alejandro Ribeiro
JMLR
2025
Convergence and Sample Complexity of Natural Policy Gradient Primal-Dual Methods for Constrained MDPs
Dongsheng Ding
,
Kaiqing Zhang
,
Jiali Duan
,
Tamer Basar
,
Mihailo R. Jovanovic
AAAI
2025
Deterministic Policy Gradient Primal-Dual Methods for Continuous-Space Constrained MDPs
Sergio Rozada
,
Dongsheng Ding
,
Antonio G. Marques
,
Alejandro Ribeiro
NeurIPS
2024
Constrained Diffusion Models via Dual Training
Shervin Khalafi
,
Dongsheng Ding
,
Alejandro Ribeiro
NeurIPS
2024
One-Shot Safety Alignment for Large Language Models via Optimal Dualization
Xinmeng Huang
,
Shuo Li
,
Edgar Dobriban
,
Osbert Bastani
,
Hamed Hassani
,
Dongsheng Ding
ICMLW
2024
One-Shot Safety Alignment for Large Language Models via Optimal Dualization
Xinmeng Huang
,
Shuo Li
,
Edgar Dobriban
,
Osbert Bastani
,
Hamed Hassani
,
Dongsheng Ding
ICMLW
2024
One-Shot Safety Alignment for Large Language Models via Optimal Dualization
Xinmeng Huang
,
Shuo Li
,
Edgar Dobriban
,
Osbert Bastani
,
Hamed Hassani
,
Dongsheng Ding
AISTATS
2024
Resilient Constrained Reinforcement Learning
Dongsheng Ding
,
Zhengyan Huan
,
Alejandro Ribeiro
NeurIPS
2023
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Dongsheng Ding
,
Chen-Yu Wei
,
Kaiqing Zhang
,
Alejandro Ribeiro
L4DC
2023
Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement Learning
Dongsheng Ding
,
Xiaohan Wei
,
Zhuoran Yang
,
Zhaoran Wang
,
Mihailo Jovanovic
ICML
2022
Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence
Dongsheng Ding
,
Chen-Yu Wei
,
Kaiqing Zhang
,
Mihailo Jovanovic
AISTATS
2021
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization
Dongsheng Ding
,
Xiaohan Wei
,
Zhuoran Yang
,
Zhaoran Wang
,
Mihailo Jovanovic
NeurIPS
2020
Natural Policy Gradient Primal-Dual Method for Constrained Markov Decision Processes
Dongsheng Ding
,
Kaiqing Zhang
,
Tamer Basar
,
Mihailo Jovanovic