Wu, Jingfeng
31 publications
ICML
2025
Gradient Descent Converges Arbitrarily Fast for Logistic Regression via Large and Adaptive Stepsizes
NeurIPS
2024
In-Context Learning of a Linear Transformer Block: Benefits of the MLP Component and One-Step GD Initialization
NeurIPS
2024
Large Stepsize Gradient Descent for Non-Homogeneous Two-Layer Networks: Margin Improvement and Fast Optimization
ICML
2022
Last Iterate Risk Bounds of SGD with Decaying Stepsize for Overparameterized Linear Regression
NeurIPS
2022
The Power and Limitation of Pretraining-Finetuning for Linear Regression Under Covariate Shift
NeurIPS
2021
Accommodating Picky Customers: Regret Bound and Exploration Complexity for Multi-Objective Reinforcement Learning