Ge, Suyu

5 publications

ICLR 2025 A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts Suyu Ge, Xihui Lin, Yunan Zhang, Jiawei Han, Hao Peng
ICLRW 2025 S2-Attention: Hardware-Aware Context Sharding Among Attention Heads Xihui Lin, Yunan Zhang, Suyu Ge, Liliang Ren, Barun Patra, Vishrav Chaudhary, Hao Peng, Xia Song
ICLR 2024 Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs Suyu Ge, Yunan Zhang, Liyuan Liu, Minjia Zhang, Jiawei Han, Jianfeng Gao
ECML-PKDD 2023 Corpus-Based Relation Extraction by Identifying and Refining Relation Patterns Sizhe Zhou, Suyu Ge, Jiaming Shen, Jiawei Han
NeurIPSW 2023 Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs Suyu Ge, Yunan Zhang, Liyuan Liu, Minjia Zhang, Jiawei Han, Jianfeng Gao