Cui, Xiaodong
17 publications
NeurIPSW
2023
Transformers as Multi-Task Feature Selectors: Generalization Analysis of In-Context Learning
NeurIPS
2022
A Stochastic Linearized Augmented Lagrangian Method for Decentralized Bilevel Optimization
NeurIPS
2020
ScaleCom: Scalable Sparsified Gradient Compression for Communication-Efficient Distributed Training