Sun, Yutao

5 publications

ICLR 2025 Differential Transformer Tianzhu Ye, Li Dong, Yuqing Xia, Yutao Sun, Yi Zhu, Gao Huang, Furu Wei
IJCAI 2025 Horae: A Domain-Agnostic Language for Automated Service Regulation Yutao Sun, Mingshuai Chen, Tiancheng Zhao, Kangjia Zhao, He Li, Jintao Chen, Zhongyi Wang, Liqiang Lu, Xinkui Zhao, Shuiguang Deng, Jianwei Yin
NeurIPS 2024 You Only Cache Once: Decoder-Decoder Architectures for Language Models Yutao Sun, Li Dong, Yi Zhu, Shaohan Huang, Wenhui Wang, Shuming Ma, Quanlu Zhang, Jianyong Wang, Furu Wei
ICLR 2023 Prototypical Calibration for Few-Shot Learning of Language Models Zhixiong Han, Yaru Hao, Li Dong, Yutao Sun, Furu Wei
ICLRW 2023 Why Can GPT Learn In-Context? Language Models Implicitly Perform Gradient Descent as Meta-Optimizers Damai Dai, Yutao Sun, Li Dong, Yaru Hao, Shuming Ma, Zhifang Sui, Furu Wei