ML Anthology
Authors
Search
About
Du, Yihan
14 publications
ICML
2025
Reinforcement Learning with Segment Feedback
Yihan Du
,
Anna Winnicki
,
Gal Dalal
,
Shie Mannor
,
R. Srikant
ICLR
2024
Cascading Reinforcement Learning
Yihan Du
,
R. Srikant
,
Wei Chen
ICML
2024
Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization
Yihan Du
,
Anna Winnicki
,
Gal Dalal
,
Shie Mannor
,
R. Srikant
ICLR
2024
Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback
Yu Chen
,
Yihan Du
,
Pihe Hu
,
Siwei Wang
,
Desheng Wu
,
Longbo Huang
ICLR
2023
Collaborative Pure Exploration in Kernel Bandit
Yihan Du
,
Wei Chen
,
Yuko Kuroki
,
Longbo Huang
ICML
2023
Multi-Task Representation Learning for Pure Exploration in Linear Bandits
Yihan Du
,
Longbo Huang
,
Wen Sun
ICLR
2023
Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path
Yihan Du
,
Siwei Wang
,
Longbo Huang
NeurIPS
2023
Provably Safe Reinforcement Learning with Step-Wise Violation Constraints
Nuoya Xiong
,
Yihan Du
,
Longbo Huang
ICML
2022
Branching Reinforcement Learning
Yihan Du
,
Wei Chen
AAAI
2021
A One-Size-Fits-All Solution to Conservative Bandit Problems
Yihan Du
,
Siwei Wang
,
Longbo Huang
NeurIPS
2021
Combinatorial Pure Exploration with Bottleneck Reward Function
Yihan Du
,
Yuko Kuroki
,
Wei Chen
AAAI
2021
Combinatorial Pure Exploration with Full-Bandit or Partial Linear Feedback
Yihan Du
,
Yuko Kuroki
,
Wei Chen
NeurIPS
2021
Continuous Mean-Covariance Bandits
Yihan Du
,
Siwei Wang
,
Zhixuan Fang
,
Longbo Huang
ICML
2020
Combinatorial Pure Exploration for Dueling Bandit
Wei Chen
,
Yihan Du
,
Longbo Huang
,
Haoyu Zhao