ML Anthology
Authors
Search
About
Huang, Youcheng
1 publications
AAAI
2025
LEGEND: Leveraging Representation Engineering to Annotate Safety Margin for Preference Datasets
Duanyu Feng
,
Bowen Qin
,
Chen Huang
,
Youcheng Huang
,
Zheng Zhang
,
Wenqiang Lei