ML Anthology
Authors
Search
About
Duan, Shitong
2 publications
ICLR
2024
Denevil: Towards Deciphering and Navigating the Ethical Values of Large Language Models via Instruction Learning
Shitong Duan
,
Xiaoyuan Yi
,
Peng Zhang
,
Tun Lu
,
Xing Xie
,
Ning Gu
IJCAI
2024
On the Essence and Prospect: An Investigation of Alignment Approaches for Big Models
Xinpeng Wang
,
Shitong Duan
,
Xiaoyuan Yi
,
Jing Yao
,
Shanlin Zhou
,
Zhihua Wei
,
Peng Zhang
,
Dongkuan Xu
,
Maosong Sun
,
Xing Xie