Li, Conglong
8 publications
NeurIPSW
2023
DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery Through Sophisticated AI System Technologies
ICML
2022
DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale
NeurIPS
2022
The Stability-Efficiency Dilemma: Investigating Sequence Length Warmup for Training GPT Models