Dery, Lucio M.

7 publications

NeurIPS 2025 Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo Zachary Charles, Gabriel Teston, Lucio M. Dery, J Keith Rush, Nova Fallen, Zachary Garrett, Arthur Szlam, Arthur Douillard
TMLR 2024 Multitask Learning Can Improve Worst-Group Outcomes Atharva Kulkarni, Lucio M. Dery, Amrith Setlur, Aditi Raghunathan, Ameet Talwalkar, Graham Neubig
ICLR 2023 AANG : Automating Auxiliary Learning Lucio M. Dery, Paul Michel, Mikhail Khodak, Graham Neubig, Ameet Talwalkar
ICML 2023 Cross-Modal Fine-Tuning: Align Then Refine Junhong Shen, Liam Li, Lucio M. Dery, Corey Staten, Mikhail Khodak, Graham Neubig, Ameet Talwalkar
NeurIPSW 2022 Multi-Step Planning for Automated Hyperparameter Optimization with OptFormer Lucio M. Dery, Abram L. Friesen, Nando de Freitas, MarcAurelio Ranzato, Yutian Chen
ICLR 2022 Should We Be Pre-Training? an Argument for End-Task Aware Training as an Alternative Lucio M. Dery, Paul Michel, Ameet Talwalkar, Graham Neubig
ICLR 2021 Auxiliary Task Update Decomposition: The Good, the Bad and the Neutral Lucio M. Dery, Yann Dauphin, David Grangier