Natural Language Understanding for African Languages

Abstract

Natural Language Understanding(NLU) is a fundamental building block of goal-oriented conversational AI. In NLU, the two key tasks are predicting the intent of the user’s query and the corresponding slots. Most NLU resources available are for high-resource languages like English. In this paper, we address the limited availability of NLU resources for African languages, most of which are considered Low Resource Languages(LRLs), by presenting the first extension of one the most widely used NLU dataset, the Airline Travel Information Systems (ATIS) dataset to Swahili, Kinyarwanda. We perform baseline experiments using BERT,mBERT, RoBERTa, XLM-RoBERTa under zero-shot settings and achieve promising results. We release the datasets and the annotation tool used for the utterance slot labeling to the community to further NLU research on NLU for African Languages.

Cite

Text

Mastel et al. "Natural Language Understanding for African Languages." ICLR 2023 Workshops: AfricaNLP, 2023.

Markdown

[Mastel et al. "Natural Language Understanding for African Languages." ICLR 2023 Workshops: AfricaNLP, 2023.](https://mlanthology.org/iclrw/2023/mastel2023iclrw-natural/)

BibTeX

@inproceedings{mastel2023iclrw-natural,
  title     = {{Natural Language Understanding for African Languages}},
  author    = {Mastel, Pierrette MAHORO and Mastel, Pierrette MAHORO and Namara, Ester and Munezero, Aime and Kagame, Richard and Wang, Zihan and Anzagira, Allan and Gupta, Akshat and Ndibwile, Jema David},
  booktitle = {ICLR 2023 Workshops: AfricaNLP},
  year      = {2023},
  url       = {https://mlanthology.org/iclrw/2023/mastel2023iclrw-natural/}
}