Jo, Hyunjik

2 publications

IJCAI 2025 Accurate Sublayer Pruning for Large Language Models by Exploiting Latency and Tunability Information Seungcheol Park, Sojin Lee, Jongjin Kim, Jinsik Lee, Hyunjik Jo, U Kang
NeurIPS 2024 Block Transformer: Global-to-Local Language Modeling for Fast Inference Namgyu Ho, Sangmin Bae, Taehyeon Kim, Hyunjik Jo, Yireun Kim, Tal Schuster, Adam Fisch, James Thorne, Se-Young Yun