ML Anthology
Authors
Search
About
Zhang, Yunan
6 publications
ICLR
2025
A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts
Suyu Ge
,
Xihui Lin
,
Yunan Zhang
,
Jiawei Han
,
Hao Peng
ICLRW
2025
S2-Attention: Hardware-Aware Context Sharding Among Attention Heads
Xihui Lin
,
Yunan Zhang
,
Suyu Ge
,
Liliang Ren
,
Barun Patra
,
Vishrav Chaudhary
,
Hao Peng
,
Xia Song
ICLR
2024
Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
Suyu Ge
,
Yunan Zhang
,
Liyuan Liu
,
Minjia Zhang
,
Jiawei Han
,
Jianfeng Gao
AAAI
2023
A Neural Span-Based Continual Named Entity Recognition Model
Yunan Zhang
,
Qingcai Chen
NeurIPSW
2023
Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
Suyu Ge
,
Yunan Zhang
,
Liyuan Liu
,
Minjia Zhang
,
Jiawei Han
,
Jianfeng Gao
AAAI
2022
Unifying Model Explainability and Robustness for Joint Text Classification and Rationale Extraction
Dongfang Li
,
Baotian Hu
,
Qingcai Chen
,
Tujie Xu
,
Jingcong Tao
,
Yunan Zhang