Explainable Abusive Language Classification Leveraging User and Network Data

Wich, Maximilian; Mosca, Edoardo; Gorniak, Adrian; Hingerl, Johannes; Groh, Georg

doi:10.1007/978-3-030-86517-7_30

Explainable Abusive Language Classification Leveraging User and Network Data

Maximilian Wich, Edoardo Mosca, Adrian Gorniak, Johannes Hingerl, Georg Groh

ECML-PKDD 2021 pp. 481-496

doi:10.1007/978-3-030-86517-7_30 /ecmlpkdd/2021/wich2021ecmlpkdd-explainable/

Abstract

Online hate speech is a phenomenon with considerable consequences for our society. Its automatic detection using machine learning is a promising approach to contain its spread. However, classifying abusive language with a model that purely relies on text data is limited in performance due to the complexity and diversity of speech (e.g., irony, sarcasm). Moreover, studies have shown that a significant amount of hate on social media platforms stems from online hate communities. Therefore, we develop an abusive language detection model leveraging user and network data to improve the classification performance. We integrate the explainable AI framework SHAP (SHapley Additive exPlanations) to alleviate the general issue of missing transparency associated with deep learning models, allowing us to assess the model’s vulnerability toward bias and systematic discrimination reliably. Furthermore, we evaluate our multimodel architecture on three datasets in two languages (i.e., English and German). Our results show that user-specific timeline and network data can improve the classification, while the additional explanations resulting from SHAP make the predictions of the model interpretable to humans.

PDF ECML-PKDD Semantic Scholar

Cite

Text

Wich et al. "Explainable Abusive Language Classification Leveraging User and Network Data." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2021. doi:10.1007/978-3-030-86517-7_30

Markdown

[Wich et al. "Explainable Abusive Language Classification Leveraging User and Network Data." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2021.](https://mlanthology.org/ecmlpkdd/2021/wich2021ecmlpkdd-explainable/) doi:10.1007/978-3-030-86517-7_30

BibTeX

@inproceedings{wich2021ecmlpkdd-explainable,
  title     = {{Explainable Abusive Language Classification Leveraging User and Network Data}},
  author    = {Wich, Maximilian and Mosca, Edoardo and Gorniak, Adrian and Hingerl, Johannes and Groh, Georg},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2021},
  pages     = {481-496},
  doi       = {10.1007/978-3-030-86517-7_30},
  url       = {https://mlanthology.org/ecmlpkdd/2021/wich2021ecmlpkdd-explainable/}
}