Jo, Dongwon
4 publications
NeurIPS
2025
Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning
NeurIPS
2024
Mixture of Scales: Memory-Efficient Token-Adaptive Binarization for Large Language Models
4 publications