Lu, Miao

16 publications

ICLR 2025 Can Neural Networks Achieve Optimal Computational-Statistical Tradeoff? an Analysis on Single-Index Model Siyu Chen, Beining Wu, Miao Lu, Zhuoran Yang, Tianhao Wang
ICLR 2024 Benign Oscillation of Stochastic Gradient Descent with Large Learning Rate Miao Lu, Beining Wu, Xiaodong Yang, Difan Zou
NeurIPSW 2024 Can Neural Networks Achieve Optimal Computational-Statistical Tradeoff? an Analysis on Single-Index Model Siyu Chen, Beining Wu, Miao Lu, Zhuoran Yang, Tianhao Wang
ICMLW 2024 Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm Miao Lu, Han Zhong, Tong Zhang, Jose Blanchet
NeurIPS 2024 Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithms Miao Lu, Han Zhong, Tong Zhang, Jose Blanchet
NeurIPS 2024 Provably Mitigating Overoptimization in RLHF: Your SFT Loss Is Implicitly an Adversarial Regularizer Zhihan Liu, Miao Lu, Shenao Zhang, Boyi Liu, Hongyi Guo, Yingxiang Yang, Jose Blanchet, Zhaoran Wang
ICMLW 2024 Provably Mitigating Overoptimization in RLHF: Your SFT Loss Is Implicitly an Adversarial Regularizer Zhihan Liu, Miao Lu, Shenao Zhang, Boyi Liu, Hongyi Guo, Yingxiang Yang, Jose Blanchet, Zhaoran Wang
NeurIPSW 2023 Benign Oscillation of Stochastic Gradient Descent with Large Learning Rate Miao Lu, Beining Wu, Xiaodong Yang, Difan Zou
NeurIPS 2023 Double Pessimism Is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage Jose Blanchet, Miao Lu, Tong Zhang, Han Zhong
NeurIPS 2023 Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration Zhihan Liu, Miao Lu, Wei Xiong, Han Zhong, Hao Hu, Shenao Zhang, Sirui Zheng, Zhuoran Yang, Zhaoran Wang
ICLR 2023 Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes Miao Lu, Yifei Min, Zhaoran Wang, Zhuoran Yang
CVPR 2022 GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection Yue Liao, Aixi Zhang, Miao Lu, Yongliang Wang, Xiaobo Li, Si Liu
ICLR 2022 Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, and No Retraining Miao Lu, Xiaolong Luo, Tianlong Chen, Wuyang Chen, Dong Liu, Zhangyang Wang
AAAI 2022 Learning Robust Policy Against Disturbance in Transition Dynamics via State-Conservative Policy Optimization Yufei Kuang, Miao Lu, Jie Wang, Qi Zhou, Bin Li, Houqiang Li
ICML 2022 Welfare Maximization in Competitive Equilibrium: Reinforcement Learning for Markov Exchange Economy Zhihan Liu, Miao Lu, Zhaoran Wang, Michael Jordan, Zhuoran Yang
NeurIPS 2021 Mining the Benefits of Two-Stage and One-Stage HOI Detection Aixi Zhang, Yue Liao, Si Liu, Miao Lu, Yongliang Wang, Chen Gao, Xiaobo Li