Zhou, Zhenli

1 publications

NeurIPS 2024 MemoryFormer : Minimize Transformer Computation by Removing Fully-Connected Layers Ning Ding, Yehui Tang, Haochen Qin, Zhenli Zhou, Chao Xu, Lin Li, Kai Han, Heng Liao, Yunhe Wang