Park, Yeonhong

7 publications

ICLR 2026 Libra: Effective yet Efficient Load Balancing for Large-Scale MoE Inference Jaehoon Yang, Yushin Kim, Seokwon Moon, Yeonhong Park, Jae W. Lee

ICLR 2026 SpareTrain: Fault-Tolerant LLM Training via Low-Cost Dual Modular Redundancy Rihae Park, Yeonjae Kim, Seung Yul Lee, Yeonhong Park, Jae W. Lee

NeurIPS 2025 DP-LLM: Runtime Model Adaptation with Dynamic Layer-Wise Precision Assignment Sangwoo Kwon, Seong Hoon Seo, Jae W. Lee, Yeonhong Park

ICML 2025 FlashTP: Fused, Sparsity-Aware Tensor Product for Machine Learning Interatomic Potentials Seung Yul Lee, Hojoon Kim, Yutack Park, Dawoon Jeong, Seungwu Han, Yeonhong Park, Jae W. Lee

ICML 2025 GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance Jinuk Kim, Marwa El Halabi, Wonpyo Park, Clemens Js Schaefer, Deokjae Lee, Yeonhong Park, Jae W. Lee, Hyun Oh Song

NeurIPS 2025 NestedFP: High-Performance, Memory-Efficient Dual-Precision Floating Point Support for LLMs Haeun Lee, Omin Kwon, Yeonhong Park, Jae W. Lee

ICML 2024 Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs Yeonhong Park, Jake Hyun, Sanglyul Cho, Bonggeun Sim, Jae W. Lee