Hong, Yinrong

1 publications

ICLR 2026 Inference-Cost-Aware Dynamic Tree Construction for Efficient Inference in Large Language Models Yinrong Hong, Zhiquan Tan, Kai Hu