ML Anthology
Authors
Search
About
Blake, Charlie
8 publications
ICLR
2025
U-$\mu$P: The Unit-Scaled Maximal Update Parametrization
Charlie Blake
,
Constantin Eichenberg
,
Josef Dean
,
Lukas Balles
,
Luke Yuri Prince
,
Björn Deiseroth
,
Andres Felipe Cruz-Salinas
,
Carlo Luschi
,
Samuel Weinbach
,
Douglas Orr
ICML
2024
SparQ Attention: Bandwidth-Efficient LLM Inference
Luka Ribar
,
Ivan Chelombiev
,
Luke Hudlass-Galley
,
Charlie Blake
,
Carlo Luschi
,
Douglas Orr
ICLRW
2024
SparQ Attention: Bandwidth-Efficient LLM Inference
Luka Ribar
,
Ivan Chelombiev
,
Luke Hudlass-Galley
,
Charlie Blake
,
Carlo Luschi
,
Douglas Orr
NeurIPSW
2024
U-$\mu$P: The Unit-Scaled Maximal Update Parametrization
Charlie Blake
,
Constantin Eichenberg
,
Josef Dean
,
Lukas Balles
,
Luke Yuri Prince
,
Björn Deiseroth
,
Andres Felipe Cruz-Salinas
,
Carlo Luschi
,
Samuel Weinbach
,
Douglas Orr
ICMLW
2024
U-μP: The Unit-Scaled Maximal Update Parametrization
Charlie Blake
,
Constantin Eichenberg
,
Josef Dean
,
Lukas Balles
,
Luke Yuri Prince
,
Björn Deiseroth
,
Andres Felipe Cruz-Salinas
,
Carlo Luschi
,
Samuel Weinbach
,
Douglas Orr
ICMLW
2024
U-μP: The Unit-Scaled Maximal Update Parametrization
Charlie Blake
,
Constantin Eichenberg
,
Josef Dean
,
Lukas Balles
,
Luke Yuri Prince
,
Björn Deiseroth
,
Andres Felipe Cruz-Salinas
,
Carlo Luschi
,
Samuel Weinbach
,
Douglas Orr
NeurIPSW
2023
Training and Inference of Large Language Models Using 8-Bit Floating Point
Sergio P. Perez
,
Yan Zhang
,
James Briggs
,
Charlie Blake
,
Josh Levy-Kramer
,
Paul Balanca
,
Carlo Luschi
,
Stephen Barlow
,
Andrew W Fitzgibbon
ICML
2023
Unit Scaling: Out-of-the-Box Low-Precision Training
Charlie Blake
,
Douglas Orr
,
Carlo Luschi