Shamis, Leonid

2 publications

TMLR 2025 Efficient Hardware Scaling and Diminishing Returns in Large-Scale Training of Language Models Jared Fernandez, Luca Wehrstedt, Leonid Shamis, Mostafa Elhoushi, Kalyan Saladi, Yonatan Bisk, Emma Strubell, Jacob Kahn
ICLR 2025 Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model Chunting Zhou, Lili Yu, Arun Babu, Kushal Tirumala, Michihiro Yasunaga, Leonid Shamis, Jacob Kahn, Xuezhe Ma, Luke Zettlemoyer, Omer Levy