Duan, Yaqi

13 publications

ICML 2025 PILAF: Optimal Human Preference Sampling for Reward Modeling Yunzhen Feng, Ariel Kwiatkowski, Kunhao Zheng, Julia Kempe, Yaqi Duan
ICLRW 2025 PILAF: Optimal Human Preference Sampling for Reward Modeling Yunzhen Feng, Ariel Kwiatkowski, Kunhao Zheng, Julia Kempe, Yaqi Duan
NeurIPS 2024 Taming "data-Hungry" Reinforcement Learning? Stability in Continuous State-Action Spaces Yaqi Duan, Martin J. Wainwright
L4DC 2023 A Finite-Sample Analysis of Multi-Step Temporal Difference Estimates Yaqi Duan, Martin J. Wainwright
IJCAI 2023 Invertible Residual Neural Networks with Conditional Injector and Interpolator for Point Cloud Upsampling Aihua Mao, Yaqi Duan, Yu-Hui Wen, Zihui Du, Hongmin Cai, Yong-Jin Liu
JMLR 2023 Learning Good State and Action Representations for Markov Decision Process via Tensor Decomposition Chengzhuo Ni, Yaqi Duan, Munther Dahleh, Mengdi Wang, Anru R. Zhang
ICLR 2022 Near-Optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism Ming Yin, Yaqi Duan, Mengdi Wang, Yu-Xiang Wang
ICML 2021 Bootstrapping Fitted Q-Evaluation for Off-Policy Inference Botao Hao, Xiang Ji, Yaqi Duan, Hao Lu, Csaba Szepesvari, Mengdi Wang
ICML 2021 Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning Yaqi Duan, Chi Jin, Zhiyuan Li
ICML 2021 Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient Botao Hao, Yaqi Duan, Tor Lattimore, Csaba Szepesvari, Mengdi Wang
ICML 2020 Minimax-Optimal Off-Policy Evaluation with Linear Function Approximation Yaqi Duan, Zeyu Jia, Mengdi Wang
NeurIPS 2019 Learning Low-Dimensional State Embeddings and Metastable Clusters from Time Series Data Yifan Sun, Yaqi Duan, Hao Gong, Mengdi Wang
NeurIPS 2019 State Aggregation Learning from Markov Transition Data Yaqi Duan, Tracy Ke, Mengdi Wang