Tirumala, Kushal

11 publications

NeurIPS 2025 CAT: Content-Adaptive Image Tokenization Junhong Shen, Kushal Tirumala, Michihiro Yasunaga, Ishan Misra, Luke Zettlemoyer, Lili Yu, Chunting Zhou
DMLR 2025 Text Quality-Based Pruning for Efficient Training of Language Models Vasu Sharma, Karthik Padthe, Newsha Ardalani, Kushal Tirumala, Russell Howes, Hu Xu, Po-Yao Huang, Daniel Li Chen, Armen Aghajanyan, Gargi Ghosh, Luke Zettlemoyer
ICLR 2025 The Unreasonable Ineffectiveness of the Deeper Layers Andrey Gromov, Kushal Tirumala, Hassan Shapourian, Paolo Glorioso, Dan Roberts
ICLR 2025 Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model Chunting Zhou, Lili Yu, Arun Babu, Kushal Tirumala, Michihiro Yasunaga, Leonid Shamis, Jacob Kahn, Xuezhe Ma, Luke Zettlemoyer, Omer Levy
NeurIPS 2025 When Worse Is Better: Navigating the Compression Generation Trade-Off in Visual Tokenization Vivek Ramanujan, Kushal Tirumala, Armen Aghajanyan, Luke Zettlemoyer, Ali Farhadi
ICLR 2024 Effective Pruning of Web-Scale Datasets Based on Complexity of Concept Clusters Amro Kamal Mohamed Abbas, Evgenia Rusak, Kushal Tirumala, Wieland Brendel, Kamalika Chaudhuri, Ari S. Morcos
NeurIPSW 2024 The Unreasonable Ineffectiveness of the Deeper Layers Andrey Gromov, Kushal Tirumala, Hassan Shapourian, Paolo Glorioso, Dan Roberts
NeurIPS 2023 D4: Improving LLM Pretraining via Document De-Duplication and Diversification Kushal Tirumala, Daniel Simig, Armen Aghajanyan, Ari Morcos
ICLRW 2023 SemDeDup: Data-Efficient Learning at Web-Scale Through Semantic Deduplication Amro Kamal Mohamed Abbas, Kushal Tirumala, Daniel Simig, Surya Ganguli, Ari S. Morcos
ICML 2022 Investigating Generalization by Controlling Normalized Margin Alexander R Farhang, Jeremy D Bernstein, Kushal Tirumala, Yang Liu, Yisong Yue
NeurIPS 2022 Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models Kushal Tirumala, Aram Markosyan, Luke Zettlemoyer, Armen Aghajanyan