He, Zhengfu

3 publications

ICLR 2025 Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures Junxuan Wang, Xuyang Ge, Wentao Shu, Qiong Tang, Yunhua Zhou, Zhengfu He, Xipeng Qiu
ICMLW 2024 Automatically Identifying Local and Global Circuits with Linear Computation Graphs Xuyang Ge, Fukang Zhu, Wentao Shu, Junxuan Wang, Zhengfu He, Xipeng Qiu
ICML 2024 Can AI Assistants Know What They Don’t Know? Qinyuan Cheng, Tianxiang Sun, Xiangyang Liu, Wenwei Zhang, Zhangyue Yin, Shimin Li, Linyang Li, Zhengfu He, Kai Chen, Xipeng Qiu