Zhang, Yunan

6 publications

ICLR 2025 A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts Suyu Ge, Xihui Lin, Yunan Zhang, Jiawei Han, Hao Peng
ICLRW 2025 S2-Attention: Hardware-Aware Context Sharding Among Attention Heads Xihui Lin, Yunan Zhang, Suyu Ge, Liliang Ren, Barun Patra, Vishrav Chaudhary, Hao Peng, Xia Song
ICLR 2024 Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs Suyu Ge, Yunan Zhang, Liyuan Liu, Minjia Zhang, Jiawei Han, Jianfeng Gao
AAAI 2023 A Neural Span-Based Continual Named Entity Recognition Model Yunan Zhang, Qingcai Chen
NeurIPSW 2023 Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs Suyu Ge, Yunan Zhang, Liyuan Liu, Minjia Zhang, Jiawei Han, Jianfeng Gao
AAAI 2022 Unifying Model Explainability and Robustness for Joint Text Classification and Rationale Extraction Dongfang Li, Baotian Hu, Qingcai Chen, Tujie Xu, Jingcong Tao, Yunan Zhang