ML Anthology
Authors
Search
About
Yao, Jinwei
3 publications
ICLR
2025
DeFT: Decoding with Flash Tree-Attention for Efficient Tree-Structured LLM Inference
Jinwei Yao
,
Kaiqi Chen
,
Kexun Zhang
,
Jiaxuan You
,
Binhang Yuan
,
Zeke Wang
,
Tao Lin
ICML
2025
ResearchTown: Simulator of Human Research Community
Haofei Yu
,
Zhaochen Hong
,
Zirui Cheng
,
Kunlun Zhu
,
Keyang Xuan
,
Jinwei Yao
,
Tao Feng
,
Jiaxuan You
ICLRW
2024
DeFT: Flash Tree-Attention with IO-Awareness for Efficient Tree-Search-Based LLM Inference
Jinwei Yao
,
Kexun Zhang
,
Kaiqi Chen
,
Jiaxuan You
,
Zeke Wang
,
Binhang Yuan
,
Tao Lin