ML Anthology
Authors
Search
About
Dang, Fan
2 publications
ICLR
2026
DynamicInfer: Runtime-Aware Sparse Offloading for LLMs Inference on a Consumer-Grade GPU
Zhui Zhu
,
Weichen Zhang
,
Zhenghan Zhou
,
Yunhao Liu
,
Fan Dang
CVPR
2025
SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity
Ke Ma
,
Jiaqi Tang
,
Bin Guo
,
Fan Dang
,
Sicong Liu
,
Zhui Zhu
,
Lei Wu
,
Cheng Fang
,
Ying-Cong Chen
,
Zhiwen Yu
,
Yunhao Liu