Jia, Bin

1 publications

ICLR 2024 AutoChunk: Automated Activation Chunk for Memory-Efficient Deep Learning Inference Xuanlei Zhao, Shenggan Cheng, Guangyang Lu, Haotian Zhou, Bin Jia, Yang You