Saladi, Kalyan

1 publications

TMLR 2025 Efficient Hardware Scaling and Diminishing Returns in Large-Scale Training of Language Models Jared Fernandez, Luca Wehrstedt, Leonid Shamis, Mostafa Elhoushi, Kalyan Saladi, Yonatan Bisk, Emma Strubell, Jacob Kahn