Zhen, Hui-Ling

12 publications

ICLR 2026 Beyond Speedup - Utilizing KV Cache for Sampling and Reasoning Zeyu Xing, Xing Li, Hui-Ling Zhen, Mingxuan Yuan, Sinno Jialin Pan
ICLR 2026 Efficient Reasoning with Balanced Thinking Yulin Li, Tengyao Tu, Li Ding, Junjie Wang, Hui-Ling Zhen, Yixin Chen, Yong Li, Zhuotao Tian
ICLR 2026 MOSS: Efficient and Accurate FP8 LLM Training with Microscaling and Automatic Scaling Yu Zhang, Hui-Ling Zhen, Mingxuan Yuan, Bei Yu
ICLR 2026 PASER: Post-Training Data Selection for Efficient Pruned Large Language Model Recovery Bowei He, Lihao Yin, Hui-Ling Zhen, Xiaokun Zhang, Mingxuan Yuan, Chen Ma
ICLR 2026 Scaling up, Speeding up: A Benchmark of Speculative Decoding for Efficient LLM Test-Time Scaling Shengyin Sun, Yiming Li, Xing Li, Yingzhao Lian, Weizhe Lin, Hui-Ling Zhen, Zhiyuan Yang, Xianzhi Yu, Chen Chen, Mingxuan Yuan, Chen Ma
ICLR 2026 TrimR: Verifier-Based Training-Free Thinking Trimming for Efficient Test-Time Scaling Weizhe Lin, Xing Li, Zhiyuan Yang, Xiaojin Fu, Hui-Ling Zhen, Yaoyuan Wang, Xianzhi Yu, Wulong Liu, Xiaosong Li, Mingxuan Yuan
ICLR 2026 Why Attention Patterns Exist: A Unifying Temporal Perspective Analysis Qingyue Yang, Jie Wang, Xing Li, Yinqi Bai, Tong Xialiang, Hui-Ling Zhen, Jianye Hao, Mingxuan Yuan, Bin Li
ICML 2025 KVTuner: Sensitivity-Aware Layer-Wise Mixed-Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference Xing Li, Zeyu Xing, Yiming Li, Linping Qu, Hui-Ling Zhen, Yiwu Yao, Wulong Liu, Sinno Jialin Pan, Mingxuan Yuan
IJCAI 2025 The Graph's Apprentice: Teaching an LLM Low-Level Knowledge for Circuit Quality Estimation Reza Moravej, Saurabh Bodhe, Zhanguang Zhang, Didier Chételat, Dimitrios Tsaras, Yingxue Zhang, Hui-Ling Zhen, Jianye Hao, Mingxuan Yuan
NeurIPS 2024 HardCore Generation: Generating Hard UNSAT Problems for Data Augmentation Joseph Cotnareanu, Zhanguang Zhang, Hui-Ling Zhen, Yingxue Zhang, Mark Coates
ECML-PKDD 2022 Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-Based Policy Learning Zeren Huang, Wenhao Chen, Weinan Zhang, Chuhan Shi, Furui Liu, Hui-Ling Zhen, Mingxuan Yuan, Jianye Hao, Yong Yu, Jun Wang
NeurIPS 2019 Pareto Multi-Task Learning Xi Lin, Hui-Ling Zhen, Zhenhua Li, Qing-Fu Zhang, Sam Kwong