Huang, Youcheng

1 publications

AAAI 2025 LEGEND: Leveraging Representation Engineering to Annotate Safety Margin for Preference Datasets Duanyu Feng, Bowen Qin, Chen Huang, Youcheng Huang, Zheng Zhang, Wenqiang Lei