Xu, Silei

2 publications

ICLR 2025 Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy Tong Wu, Shujian Zhang, Kaiqiang Song, Silei Xu, Sanqiang Zhao, Ravi Agrawal, Sathish Reddy Indurthi, Chong Xiang, Prateek Mittal, Wenxuan Zhou
NeurIPSW 2024 Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy Tong Wu, Shujian Zhang, Kaiqiang Song, Silei Xu, Sanqiang Zhao, Ravi Agrawal, Sathish Reddy Indurthi, Chong Xiang, Prateek Mittal, Wenxuan Zhou