ML Anthology
Authors
Search
About
Xu, Silei
2 publications
ICLR
2025
Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Tong Wu
,
Shujian Zhang
,
Kaiqiang Song
,
Silei Xu
,
Sanqiang Zhao
,
Ravi Agrawal
,
Sathish Reddy Indurthi
,
Chong Xiang
,
Prateek Mittal
,
Wenxuan Zhou
NeurIPSW
2024
Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Tong Wu
,
Shujian Zhang
,
Kaiqiang Song
,
Silei Xu
,
Sanqiang Zhao
,
Ravi Agrawal
,
Sathish Reddy Indurthi
,
Chong Xiang
,
Prateek Mittal
,
Wenxuan Zhou