Li, Zhuohan
10 publications
ICML
2020
Train Big, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers
ICLR
2020
Understanding and Improving Transformer from a Multi-Particle Dynamic System Point of View
10 publications