Wehrstedt, Luca

2 publications

ICLRW 2025 Accelerating Transformer Inference and Training with 2:4 Activation Sparsity Daniel Haziza, Timothy Chou, Dhruv Choudhary, Jesse Cai, Luca Wehrstedt, Francisco Massa, Jiecao Yu, Geonhwa Jeong, Supriya Rao, Patrick Labatut
TMLR 2025 Efficient Hardware Scaling and Diminishing Returns in Large-Scale Training of Language Models Jared Fernandez, Luca Wehrstedt, Leonid Shamis, Mostafa Elhoushi, Kalyan Saladi, Yonatan Bisk, Emma Strubell, Jacob Kahn