Explainable Text Classification via Attentive and Targeted Mixing Data Augmentation

Abstract

Mixing data augmentation methods have been widely used in text classification recently. However, existing methods do not control the quality of augmented data and have low model explainability. To tackle these issues, this paper proposes an explainable text classification solution based on attentive and targeted mixing data augmentation, ATMIX. Instead of selecting data for augmentation without control, ATMIX focuses on the misclassified training samples as the target for augmentation to better improve the model's capability. Meanwhile, to generate meaningful augmented samples, it adopts a self-attention mechanism to understand the importance of the subsentences in a text, and cut and mix the subsentences between the misclassified and correctly classified samples wisely. Furthermore, it employs a novel dynamic augmented data selection framework based on the loss function gradient to dynamically optimize the augmented samples for model training. In the end, we develop a new model explainability evaluation method based on subsentence attention and conduct extensive evaluations over multiple real-world text datasets. The results indicate that ATMIX is more effective with higher explainability than the typical classification models, hidden-level, and input-level mixup models.

Cite

Text

Jiang et al. "Explainable Text Classification via Attentive and Targeted Mixing Data Augmentation." International Joint Conference on Artificial Intelligence, 2023. doi:10.24963/IJCAI.2023/565

Markdown

[Jiang et al. "Explainable Text Classification via Attentive and Targeted Mixing Data Augmentation." International Joint Conference on Artificial Intelligence, 2023.](https://mlanthology.org/ijcai/2023/jiang2023ijcai-explainable/) doi:10.24963/IJCAI.2023/565

BibTeX

@inproceedings{jiang2023ijcai-explainable,
  title     = {{Explainable Text Classification via Attentive and Targeted Mixing Data Augmentation}},
  author    = {Jiang, Songhao and Chu, Yan and Wang, Zhengkui and Ma, Tianxing and Wang, Hanlin and Lu, Wenxuan and Zang, Tianning and Wang, Bo},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2023},
  pages     = {5085-5094},
  doi       = {10.24963/IJCAI.2023/565},
  url       = {https://mlanthology.org/ijcai/2023/jiang2023ijcai-explainable/}
}