Li, Shengping

2 publications

ICML 2025 MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections Da Xiao, Qingye Meng, Shengping Li, Xingyuan Yuan
ICML 2024 Improving Transformers with Dynamically Composable Multi-Head Attention Da Xiao, Qingye Meng, Shengping Li, Xingyuan Yuan