Explainable Abusive Language Classification Leveraging User and Network Data
Abstract
Online hate speech is a phenomenon with considerable consequences for our society. Its automatic detection using machine learning is a promising approach to contain its spread. However, classifying abusive language with a model that purely relies on text data is limited in performance due to the complexity and diversity of speech (e.g., irony, sarcasm). Moreover, studies have shown that a significant amount of hate on social media platforms stems from online hate communities. Therefore, we develop an abusive language detection model leveraging user and network data to improve the classification performance. We integrate the explainable AI framework SHAP (SHapley Additive exPlanations) to alleviate the general issue of missing transparency associated with deep learning models, allowing us to assess the model’s vulnerability toward bias and systematic discrimination reliably. Furthermore, we evaluate our multimodel architecture on three datasets in two languages (i.e., English and German). Our results show that user-specific timeline and network data can improve the classification, while the additional explanations resulting from SHAP make the predictions of the model interpretable to humans.
Cite
Text
Wich et al. "Explainable Abusive Language Classification Leveraging User and Network Data." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2021. doi:10.1007/978-3-030-86517-7_30Markdown
[Wich et al. "Explainable Abusive Language Classification Leveraging User and Network Data." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2021.](https://mlanthology.org/ecmlpkdd/2021/wich2021ecmlpkdd-explainable/) doi:10.1007/978-3-030-86517-7_30BibTeX
@inproceedings{wich2021ecmlpkdd-explainable,
title = {{Explainable Abusive Language Classification Leveraging User and Network Data}},
author = {Wich, Maximilian and Mosca, Edoardo and Gorniak, Adrian and Hingerl, Johannes and Groh, Georg},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2021},
pages = {481-496},
doi = {10.1007/978-3-030-86517-7_30},
url = {https://mlanthology.org/ecmlpkdd/2021/wich2021ecmlpkdd-explainable/}
}