ML Anthology
Authors
Search
About
Pagliardini, Matteo
16 publications
TMLR
2026
Leveraging the True Depth of LLMs
Ramón Calvo González
,
Daniele Paliotta
,
Matteo Pagliardini
,
Martin Jaggi
,
François Fleuret
ICLR
2025
CoTFormer: A Chain of Thought Driven Architecture with Budget-Adaptive Computation Cost at Inference
Amirkeivan Mohtashami
,
Matteo Pagliardini
,
Martin Jaggi
ICLRW
2025
Leveraging the True Depth of LLMs
Ramón Calvo González
,
Daniele Paliotta
,
Matteo Pagliardini
,
Martin Jaggi
,
François Fleuret
ICLR
2025
The AdEMAMix Optimizer: Better, Faster, Older
Matteo Pagliardini
,
Pierre Ablin
,
David Grangier
ICLRW
2025
Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners
Daniele Paliotta
,
Junxiong Wang
,
Matteo Pagliardini
,
Kevin Li
,
Aviv Bick
,
Albert Gu
,
François Fleuret
,
Tri Dao
ICLR
2024
A Primal-Dual Approach to Solving Variational Inequalities with General Constraints
Tatjana Chavdarova
,
Tong Yang
,
Matteo Pagliardini
,
Michael Jordan
NeurIPSW
2024
AdEMAMix: Better and Faster Training with Older Gradients
Matteo Pagliardini
,
Pierre Ablin
,
David Grangier
ICML
2024
DOGE: Domain Reweighting with Generalization Estimation
Simin Fan
,
Matteo Pagliardini
,
Martin Jaggi
NeurIPS
2024
DenseFormer: Enhancing Information Flow in Transformers via Depth Weighted Averaging
Matteo Pagliardini
,
Amirkeivan Mohtashami
,
Francois Fleuret
,
Martin Jaggi
ICLR
2023
Agree to Disagree: Diversity Through Disagreement for Better Transferability
Matteo Pagliardini
,
Martin Jaggi
,
François Fleuret
,
Sai Praneeth Karimireddy
NeurIPSW
2023
CoTFormer: More Tokens with Attention Make up for Less Depth
Amirkeivan Mohtashami
,
Matteo Pagliardini
,
Martin Jaggi
NeurIPSW
2023
DOGE: Domain Reweighting with Generalization Estimation
Simin Fan
,
Matteo Pagliardini
,
Martin Jaggi
NeurIPS
2023
Fast Attention over Long Sequences with Dynamic Sparse Flash Attention
Matteo Pagliardini
,
Daniele Paliotta
,
Martin Jaggi
,
François Fleuret
ICMLW
2023
Fast Causal Attention with Dynamic Sparsity
Daniele Paliotta
,
Matteo Pagliardini
,
Martin Jaggi
,
François Fleuret
NeurIPSW
2022
Diversity Through Disagreement for Better Transferability
Matteo Pagliardini
,
Martin Jaggi
,
François Fleuret
,
Sai Praneeth Karimireddy
ICLR
2021
Taming GANs with Lookahead-Minmax
Tatjana Chavdarova
,
Matteo Pagliardini
,
Sebastian U Stich
,
François Fleuret
,
Martin Jaggi