Du, Yihan

14 publications

ICML 2025 Reinforcement Learning with Segment Feedback Yihan Du, Anna Winnicki, Gal Dalal, Shie Mannor, R. Srikant
ICLR 2024 Cascading Reinforcement Learning Yihan Du, R. Srikant, Wei Chen
ICML 2024 Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization Yihan Du, Anna Winnicki, Gal Dalal, Shie Mannor, R. Srikant
ICLR 2024 Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback Yu Chen, Yihan Du, Pihe Hu, Siwei Wang, Desheng Wu, Longbo Huang
ICLR 2023 Collaborative Pure Exploration in Kernel Bandit Yihan Du, Wei Chen, Yuko Kuroki, Longbo Huang
ICML 2023 Multi-Task Representation Learning for Pure Exploration in Linear Bandits Yihan Du, Longbo Huang, Wen Sun
ICLR 2023 Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path Yihan Du, Siwei Wang, Longbo Huang
NeurIPS 2023 Provably Safe Reinforcement Learning with Step-Wise Violation Constraints Nuoya Xiong, Yihan Du, Longbo Huang
ICML 2022 Branching Reinforcement Learning Yihan Du, Wei Chen
AAAI 2021 A One-Size-Fits-All Solution to Conservative Bandit Problems Yihan Du, Siwei Wang, Longbo Huang
NeurIPS 2021 Combinatorial Pure Exploration with Bottleneck Reward Function Yihan Du, Yuko Kuroki, Wei Chen
AAAI 2021 Combinatorial Pure Exploration with Full-Bandit or Partial Linear Feedback Yihan Du, Yuko Kuroki, Wei Chen
NeurIPS 2021 Continuous Mean-Covariance Bandits Yihan Du, Siwei Wang, Zhixuan Fang, Longbo Huang
ICML 2020 Combinatorial Pure Exploration for Dueling Bandit Wei Chen, Yihan Du, Longbo Huang, Haoyu Zhao