Lee, Sihwa

1 publications

NeurIPS 2023 Token-Scaled Logit Distillation for Ternary Weight Generative Language Models Minsoo Kim, Sihwa Lee, Janghwan Lee, Sukjin Hong, Du-Seong Chang, Wonyong Sung, Jungwook Choi