Wu, Xiaoxia
15 publications
AAAI
2024
Exploring Post-Training Quantization in LLMs from Comprehensive Study to Low Rank Compensation
NeurIPS
2024
Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding
NeurIPSW
2023
DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery Through Sophisticated AI System Technologies