XTREME: A Massively Multilingual Multi-Task Benchmark for Evaluating Cross-Lingual Generalisation

Abstract

Much recent progress in applications of machine learning models to NLP has been driven by benchmarks that evaluate models across a wide variety of tasks. However, these broad-coverage benchmarks have been mostly limited to English, and despite an increasing interest in multilingual models, a benchmark that enables the comprehensive evaluation of such methods on a diverse range of languages and tasks is still missing. To this end, we introduce the Cross-lingual TRansfer Evaluation of Multilingual Encoders (XTREME) benchmark, a multi-task benchmark for evaluating the cross-lingual generalization capabilities of multilingual representations across 40 languages and 9 tasks. We demonstrate that while models tested on English reach human performance on many tasks, there is still a sizable gap in the performance of cross-lingually transferred models, particularly on syntactic and sentence retrieval tasks. There is also a wide spread of results across languages. We will release the benchmark to encourage research on cross-lingual learning methods that transfer linguistic knowledge across a diverse and representative set of languages and tasks.

Cite

Text

Hu et al. "XTREME: A Massively Multilingual Multi-Task Benchmark for Evaluating Cross-Lingual Generalisation." International Conference on Machine Learning, 2020.

Markdown

[Hu et al. "XTREME: A Massively Multilingual Multi-Task Benchmark for Evaluating Cross-Lingual Generalisation." International Conference on Machine Learning, 2020.](https://mlanthology.org/icml/2020/hu2020icml-xtreme/)

BibTeX

@inproceedings{hu2020icml-xtreme,
  title     = {{XTREME: A Massively Multilingual Multi-Task Benchmark for Evaluating Cross-Lingual Generalisation}},
  author    = {Hu, Junjie and Ruder, Sebastian and Siddhant, Aditya and Neubig, Graham and Firat, Orhan and Johnson, Melvin},
  booktitle = {International Conference on Machine Learning},
  year      = {2020},
  pages     = {4411-4421},
  volume    = {119},
  url       = {https://mlanthology.org/icml/2020/hu2020icml-xtreme/}
}