Cam-Tu, Nguyen

1 publications

NeurIPS 2025 SmallKV: Small Model Assisted Compensation of KV Cache Compression for Efficient LLM Inference Yi Zhao, Yajuan Peng, Nguyen Cam-Tu, Zuchao Li, Wang Xiaoliang, Hai Zhao, Xiaoming Fu