ML Anthology
Authors
Search
About
Wu, Diyuan
5 publications
NeurIPS
2025
Attention with Trained Embeddings Provably Selects Important Tokens
Diyuan Wu
,
Aleksandr Shevchenko
,
Samet Oymak
,
Marco Mondelli
ICML
2025
Neural Collapse Beyond the Unconstrained Features Model: Landscape, Dynamics, and Generalization in the Mean-Field Regime
Diyuan Wu
,
Marco Mondelli
NeurIPS
2024
The Iterative Optimal Brain Surgeon: Faster Sparse Recovery by Leveraging Second-Order Information
Diyuan Wu
,
Ionut-Vlad Modoranu
,
Mher Safaryan
,
Denis Kuznedelev
,
Dan Alistarh
TMLR
2023
Mean-Field Analysis for Heavy Ball Methods: Dropout-Stability, Connectivity, and Global Convergence
Diyuan Wu
,
Vyacheslav Kungurtsev
,
Marco Mondelli
NeurIPSW
2022
Mean-Field Analysis for Heavy Ball Methods: Dropout-Stability, Connectivity, and Global Convergence
Diyuan Wu
,
Vyacheslav Kungurtsev
,
Marco Mondelli