Anand, Rathul

1 publications

ICLR 2025 Mini-Batch Coresets for Memory-Efficient Language Model Training on Data Mixtures Dang Nguyen, Wenhan Yang, Rathul Anand, Yu Yang, Baharan Mirzasoleiman