ML Anthology
Authors
Search
About
Yang, Tsung-Huan
1 publications
NeurIPSW
2024
Preserving Safety in Fine-Tuned Large Language Models: A Systematic Evaluation and Mitigation Strategy
Tsung-Huan Yang
,
Ko-Wei Huang
,
Yung-Hui Li
,
Lun-Wei Ku