Wu, Guoqiang
15 publications
NeurIPS
2024
Lower Bounds of Uniform Stability in Gradient-Based Bilevel Algorithms for Hyperparameter Optimization
NeurIPS
2024
On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability
15 publications