Kim, Jae-Joon
13 publications
NeurIPS
2025
Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning
NeurIPS
2024
Mixture of Scales: Memory-Efficient Token-Adaptive Binarization for Large Language Models
ICML
2024
SLEB: Streamlining LLMs Through Redundancy Verification and Elimination of Transformer Blocks
NeurIPS
2023
Leveraging Early-Stage Robustness in Diffusion Models for Efficient and High-Quality Image Synthesis
ICLR
2023
Winning Both the Accuracy of Floating Point Activation and the Simplicity of Integer Arithmetic