Yang, Wenkai

9 publications

NeurIPS 2025 Learning to Focus: Causal Attention Distillation via Gradient‐Guided Token Pruning Yiju Guo, Wenkai Yang, Zexu Sun, Ning Ding, Zhiyuan Liu, Yankai Lin
ICLR 2025 Super(ficial)-Alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization Wenkai Yang, Shiqi Shen, Guangyao Shen, Wei Yao, Yong Liu, Gong Zhi, Yankai Lin, Ji-Rong Wen
NeurIPS 2025 Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning Wenkai Yang, Shuming Ma, Yankai Lin, Furu Wei
ICLRW 2025 Understanding the Capabilities and Limitations of Weak-to-Strong Generalization Wei Yao, Wenkai Yang, Ziqiao Wang, Yankai Lin, Yong Liu
TMLR 2024 Decentralized Decoupled Training for Federated Long-Tailed Learning Wenkai Yang, Deli Chen, Hao Zhou, Fandong Meng, Jie Zhou, Xu Sun
ICLR 2024 Towards Codable Watermarking for Injecting Multi-Bits Information to LLMs Lean Wang, Wenkai Yang, Deli Chen, Hao Zhou, Yankai Lin, Fandong Meng, Jie Zhou, Xu Sun
NeurIPS 2024 Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents Wenkai Yang, Xiaohan Bi, Yankai Lin, Sishuo Chen, Jie Zhou, Xu Sun
TMLR 2023 When to Trust Aggregated Gradients: Addressing Negative Client Sampling in Federated Learning Wenkai Yang, Yankai Lin, Guangxiang Zhao, Peng Li, Jie Zhou, Xu Sun
AAAI 2022 Well-Classified Examples Are Underestimated in Classification with Deep Neural Networks Guangxiang Zhao, Wenkai Yang, Xuancheng Ren, Lei Li, Yunfang Wu, Xu Sun