ArSen-20: A New Benchmark for Arabic Sentiment Detection

Abstract

Sentiment detection remains a pivotal task in natural language processing, yet its development in Arabic lags due to a scarcity of training materials compared to English. Addressing this gap, we present ArSen-20, a benchmark dataset tailored to propel Arabic sentiment detection forward. ArSen-20 comprises 20,000 professionally labeled tweets sourced from Twitter, focusing on the theme of COVID-19 and spanning the period from 2020 to 2023. Beyond tweet content, the dataset incorporates metadata associated with the user, enriching the contextual understanding. ArSen-20 offers a comprehensive resource to foster advancements in Arabic sentiment analysis and facilitate research in this critical domain.

Cite

Text

Fang and Xu. "ArSen-20: A New Benchmark for Arabic Sentiment Detection." ICLR 2024 Workshops: AfricaNLP, 2024.

Markdown

[Fang and Xu. "ArSen-20: A New Benchmark for Arabic Sentiment Detection." ICLR 2024 Workshops: AfricaNLP, 2024.](https://mlanthology.org/iclrw/2024/fang2024iclrw-arsen20/)

BibTeX

@inproceedings{fang2024iclrw-arsen20,
  title     = {{ArSen-20: A New Benchmark for Arabic Sentiment Detection}},
  author    = {Fang, Yang and Xu, Cheng},
  booktitle = {ICLR 2024 Workshops: AfricaNLP},
  year      = {2024},
  url       = {https://mlanthology.org/iclrw/2024/fang2024iclrw-arsen20/}
}