Yang, Tsung-Huan

1 publications

NeurIPSW 2024 Preserving Safety in Fine-Tuned Large Language Models: A Systematic Evaluation and Mitigation Strategy Tsung-Huan Yang, Ko-Wei Huang, Yung-Hui Li, Lun-Wei Ku