Cherry, Colin

6 publications

ICLRW 2025 Don't Throw Away Data: Improving Sequence Knowledge Distillation with Minimum Bayes Risk Decoding Jun Wang, Eleftheria Briakou, Hamid Dadkhahi, Rishabh Agarwal, Colin Cherry, Trevor Cohn
ICML 2025 Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination’s Impact on Machine Translation Muhammed Yusuf Kocyigit, Eleftheria Briakou, Daniel Deutsch, Jiaming Luo, Colin Cherry, Markus Freitag
ICLR 2024 When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method Biao Zhang, Zhongtao Liu, Colin Cherry, Orhan Firat
ICML 2023 The Unreasonable Effectiveness of Few-Shot Learning for Machine Translation Xavier Garcia, Yamini Bansal, Colin Cherry, George Foster, Maxim Krikun, Melvin Johnson, Orhan Firat
ICML 2022 Data Scaling Laws in NMT: The Effect of Noise and Architecture Yamini Bansal, Behrooz Ghorbani, Ankush Garg, Biao Zhang, Colin Cherry, Behnam Neyshabur, Orhan Firat
ICLR 2022 Scaling Laws for Neural Machine Translation Behrooz Ghorbani, Orhan Firat, Markus Freitag, Ankur Bapna, Maxim Krikun, Xavier Garcia, Ciprian Chelba, Colin Cherry