He, Yuxiong
19 publications
ICLR
2025
ConvCodeWorld: Benchmarking Conversational Code Generation in Reproducible Feedback Environments
AAAI
2024
Exploring Post-Training Quantization in LLMs from Comprehensive Study to Low Rank Compensation
NeurIPSW
2023
DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery Through Sophisticated AI System Technologies
AAAI
2022
Adversarial Data Augmentation for Task-Specific Knowledge Distillation of Pre-Trained Transformers
ICML
2022
DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale
NeurIPS
2022
The Stability-Efficiency Dilemma: Investigating Sequence Length Warmup for Training GPT Models
NeurIPS
2022
ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers
NeurIPS
2021
NxMTransformer: Semi-Structured Sparsification for Natural Language Understanding via ADMM
NeurIPS
2021
SimiGrad: Fine-Grained Adaptive Batching for Large Scale Training Using Gradient Similarity Measurement