Wu, Diyuan

5 publications

NeurIPS 2025 Attention with Trained Embeddings Provably Selects Important Tokens Diyuan Wu, Aleksandr Shevchenko, Samet Oymak, Marco Mondelli
ICML 2025 Neural Collapse Beyond the Unconstrained Features Model: Landscape, Dynamics, and Generalization in the Mean-Field Regime Diyuan Wu, Marco Mondelli
NeurIPS 2024 The Iterative Optimal Brain Surgeon: Faster Sparse Recovery by Leveraging Second-Order Information Diyuan Wu, Ionut-Vlad Modoranu, Mher Safaryan, Denis Kuznedelev, Dan Alistarh
TMLR 2023 Mean-Field Analysis for Heavy Ball Methods: Dropout-Stability, Connectivity, and Global Convergence Diyuan Wu, Vyacheslav Kungurtsev, Marco Mondelli
NeurIPSW 2022 Mean-Field Analysis for Heavy Ball Methods: Dropout-Stability, Connectivity, and Global Convergence Diyuan Wu, Vyacheslav Kungurtsev, Marco Mondelli