Cojocaru, Ruxandra

1 publications

NeurIPS 2023 The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data Only Guilherme Penedo, Quentin Malartic, Daniel Hesslow, Ruxandra Cojocaru, Hamza Alobeidli, Alessandro Cappelli, Baptiste Pannier, Ebtesam Almazrouei, Julien Launay