Fei, Weizhi

1 publications

NeurIPS 2025 Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference Weizhi Fei, Xueyan Niu, Xie Guoqing, Yingqing Liu, Bo Bai, Wei Han