Boizard, Nicolas

3 publications

ICLR 2026 Should We Still Pretrain Encoders with Masked Language Modeling? Hippolyte Gisserot-Boukhlef, Nicolas Boizard, Manuel Faysse, Duarte Miguel Alves, Emmanuel Malherbe, Andre Martins, Celine Hudelot, Pierre Colombo
TMLR 2025 CroissantLLM: A Truly Bilingual French-English Language Model Manuel Faysse, Patrick Fernandes, Nuno M Guerreiro, António Loison, Duarte Miguel Alves, Caio Corro, Nicolas Boizard, João Alves, Ricardo Rei, Pedro Henrique Martins, Antoni Bigata Casademunt, François Yvon, Andre Martins, Gautier Viaud, Celine Hudelot, Pierre Colombo
TMLR 2025 Towards Cross-Tokenizer Distillation: The Universal Logit Distillation Loss for LLMs Nicolas Boizard, Kevin El Haddad, Celine Hudelot, Pierre Colombo