Kwon, Se Jung
13 publications
NeurIPS
2024
DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation
ICML
2023
FlexRound: Learnable Rounding Based on Element-Wise Division for Post-Training Quantization
NeurIPS
2023
Memory-Efficient Fine-Tuning of Compressed Large Language Models via Sub-4-Bit Integer Quantization