Lu, Miao
16 publications
NeurIPSW
2024
Can Neural Networks Achieve Optimal Computational-Statistical Tradeoff? an Analysis on Single-Index Model
NeurIPS
2024
Provably Mitigating Overoptimization in RLHF: Your SFT Loss Is Implicitly an Adversarial Regularizer
ICMLW
2024
Provably Mitigating Overoptimization in RLHF: Your SFT Loss Is Implicitly an Adversarial Regularizer
NeurIPS
2023
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration
ICLR
2022
Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, and No Retraining