Wu, Xiaoxia
16 publications
ICLR
2026
CARE: Covariance-Aware and Rank-Enhanced Decomposition for Enabling Multi-Head Latent Attention
AAAI
2024
Exploring Post-Training Quantization in LLMs from Comprehensive Study to Low Rank Compensation
NeurIPS
2024
Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding
NeurIPSW
2023
DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery Through Sophisticated AI System Technologies