Yao, Jinwei

3 publications

ICLR 2025 DeFT: Decoding with Flash Tree-Attention for Efficient Tree-Structured LLM Inference Jinwei Yao, Kaiqi Chen, Kexun Zhang, Jiaxuan You, Binhang Yuan, Zeke Wang, Tao Lin
ICML 2025 ResearchTown: Simulator of Human Research Community Haofei Yu, Zhaochen Hong, Zirui Cheng, Kunlun Zhu, Keyang Xuan, Jinwei Yao, Tao Feng, Jiaxuan You
ICLRW 2024 DeFT: Flash Tree-Attention with IO-Awareness for Efficient Tree-Search-Based LLM Inference Jinwei Yao, Kexun Zhang, Kaiqi Chen, Jiaxuan You, Zeke Wang, Binhang Yuan, Tao Lin