Han, Xiaomeng

2 publications

ICLR 2026 NLI : Non-Uniform Linear Interpolation Approximation of Nonlinear Operations for Efficient LLMs Inference Jiangyong Yu, Xiaomeng Han, Xing Hu, Chen Xu, Zhe Jiang, Dawei Yang
AAAI 2025 Pushing the Limits of BFP on Narrow Precision LLM Inference Hui Wang, Yuan Cheng, Xiaomeng Han, Zhengpeng Zhao, Dawei Yang, Zhe Jiang