D'Angelo, Francesco

6 publications

ICLR 2025 Selective Induction Heads: How Transformers Select Causal Structures in Context Francesco D'Angelo, Francesco Croce, Nicolas Flammarion

NeurIPS 2025 The Emergence of Sparse Attention: Impact of Data Distribution and Benefits of Repetition Nicolas Zucchet, Francesco D'Angelo, Andrew Kyle Lampinen, Stephanie C.Y. Chan

NeurIPS 2024 Why Do We Need Weight Decay in Modern Deep Learning? Francesco D'Angelo, Maksym Andriushchenko, Aditya Varre, Nicolas Flammarion

NeurIPSW 2023 Why Do We Need Weight Decay for Overparameterized Deep Networks? Francesco D'Angelo, Aditya Varre, Maksym Andriushchenko, Nicolas Flammarion

NeurIPS 2021 Posterior Meta-Replay for Continual Learning Christian Henning, Maria Cervera, Francesco D'Angelo, Johannes von Oswald, Regina Traber, Benjamin Ehret, Seijin Kobayashi, Benjamin F. Grewe, João Sacramento

NeurIPS 2021 Repulsive Deep Ensembles Are Bayesian Francesco D'Angelo, Vincent Fortuin