Dang, Fan

2 publications

ICLR 2026 DynamicInfer: Runtime-Aware Sparse Offloading for LLMs Inference on a Consumer-Grade GPU Zhui Zhu, Weichen Zhang, Zhenghan Zhou, Yunhao Liu, Fan Dang
CVPR 2025 SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity Ke Ma, Jiaqi Tang, Bin Guo, Fan Dang, Sicong Liu, Zhui Zhu, Lei Wu, Cheng Fang, Ying-Cong Chen, Zhiwen Yu, Yunhao Liu