Di, Qiwei

7 publications

ICML 2025 Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback Qiwei Di, Jiafan He, Quanquan Gu
ICLR 2025 Unified Convergence Analysis for Score-Based Diffusion Models with Deterministic Samplers Runjia Li, Qiwei Di, Quanquan Gu
ICML 2024 Borda Regret Minimization for Generalized Linear Dueling Bandits Yue Wu, Tao Jin, Qiwei Di, Hao Lou, Farzad Farnoud, Quanquan Gu
ICLR 2024 Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning Qiwei Di, Heyang Zhao, Jiafan He, Quanquan Gu
ICLR 2024 Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits Qiwei Di, Tao Jin, Yue Wu, Heyang Zhao, Farzad Farnoud, Quanquan Gu
ICMLW 2023 Borda Regret Minimization for Generalized Linear Dueling Bandits Yue Wu, Tao Jin, Qiwei Di, Hao Lou, Farzad Farnoud, Quanquan Gu
ICML 2023 Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path Qiwei Di, Jiafan He, Dongruo Zhou, Quanquan Gu