Tian, Boyu

1 publications

NeurIPS 2025 Twilight: Adaptive Attention Sparsity with Hierarchical Top-$p$ Pruning Chaofan Lin, Jiaming Tang, Shuo Yang, Hanshuo Wang, Tian Tang, Boyu Tian, Ion Stoica, Song Han, Mingyu Gao