Pagliardini, Matteo

16 publications

TMLR 2026 Leveraging the True Depth of LLMs Ramón Calvo González, Daniele Paliotta, Matteo Pagliardini, Martin Jaggi, François Fleuret
ICLR 2025 CoTFormer: A Chain of Thought Driven Architecture with Budget-Adaptive Computation Cost at Inference Amirkeivan Mohtashami, Matteo Pagliardini, Martin Jaggi
ICLRW 2025 Leveraging the True Depth of LLMs Ramón Calvo González, Daniele Paliotta, Matteo Pagliardini, Martin Jaggi, François Fleuret
ICLR 2025 The AdEMAMix Optimizer: Better, Faster, Older Matteo Pagliardini, Pierre Ablin, David Grangier
ICLRW 2025 Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners Daniele Paliotta, Junxiong Wang, Matteo Pagliardini, Kevin Li, Aviv Bick, Albert Gu, François Fleuret, Tri Dao
ICLR 2024 A Primal-Dual Approach to Solving Variational Inequalities with General Constraints Tatjana Chavdarova, Tong Yang, Matteo Pagliardini, Michael Jordan
NeurIPSW 2024 AdEMAMix: Better and Faster Training with Older Gradients Matteo Pagliardini, Pierre Ablin, David Grangier
ICML 2024 DOGE: Domain Reweighting with Generalization Estimation Simin Fan, Matteo Pagliardini, Martin Jaggi
NeurIPS 2024 DenseFormer: Enhancing Information Flow in Transformers via Depth Weighted Averaging Matteo Pagliardini, Amirkeivan Mohtashami, Francois Fleuret, Martin Jaggi
ICLR 2023 Agree to Disagree: Diversity Through Disagreement for Better Transferability Matteo Pagliardini, Martin Jaggi, François Fleuret, Sai Praneeth Karimireddy
NeurIPSW 2023 CoTFormer: More Tokens with Attention Make up for Less Depth Amirkeivan Mohtashami, Matteo Pagliardini, Martin Jaggi
NeurIPSW 2023 DOGE: Domain Reweighting with Generalization Estimation Simin Fan, Matteo Pagliardini, Martin Jaggi
NeurIPS 2023 Fast Attention over Long Sequences with Dynamic Sparse Flash Attention Matteo Pagliardini, Daniele Paliotta, Martin Jaggi, François Fleuret
ICMLW 2023 Fast Causal Attention with Dynamic Sparsity Daniele Paliotta, Matteo Pagliardini, Martin Jaggi, François Fleuret
NeurIPSW 2022 Diversity Through Disagreement for Better Transferability Matteo Pagliardini, Martin Jaggi, François Fleuret, Sai Praneeth Karimireddy
ICLR 2021 Taming GANs with Lookahead-Minmax Tatjana Chavdarova, Matteo Pagliardini, Sebastian U Stich, François Fleuret, Martin Jaggi