D'Angelo, Francesco

6 publications

ICLR 2025 Selective Induction Heads: How Transformers Select Causal Structures in Context Francesco D'Angelo, Francesco Croce, Nicolas Flammarion
NeurIPS 2025 The Emergence of Sparse Attention: Impact of Data Distribution and Benefits of Repetition Nicolas Zucchet, Francesco D'Angelo, Andrew Kyle Lampinen, Stephanie C.Y. Chan
NeurIPS 2024 Why Do We Need Weight Decay in Modern Deep Learning? Francesco D'Angelo, Maksym Andriushchenko, Aditya Varre, Nicolas Flammarion
NeurIPSW 2023 Why Do We Need Weight Decay for Overparameterized Deep Networks? Francesco D'Angelo, Aditya Varre, Maksym Andriushchenko, Nicolas Flammarion
NeurIPS 2021 Posterior Meta-Replay for Continual Learning Christian Henning, Maria Cervera, Francesco D'Angelo, Johannes von Oswald, Regina Traber, Benjamin Ehret, Seijin Kobayashi, Benjamin F. Grewe, João Sacramento
NeurIPS 2021 Repulsive Deep Ensembles Are Bayesian Francesco D'Angelo, Vincent Fortuin