Compositional Generalization for Multi-Label Text Classification: A Data-Augmentation Approach

Abstract

Despite significant advancements in multi-label text classification, the ability of existing models to generalize to novel and seldom-encountered complex concepts, which are compositions of elementary ones, remains underexplored. This research addresses this gap. By creating unique data splits across three benchmarks, we assess the compositional generalization ability of existing multi-label text classification models. Our results show that these models often fail to generalize to compositional concepts encountered infrequently during training, leading to inferior performance on tests with these new combinations. To address this, we introduce a data augmentation method that leverages two innovative text generation models designed to enhance the classification models' capacity for compositional generalization. Our experiments show that this data augmentation approach significantly improves the compositional generalization capabilities of classification models on our benchmarks, with both generation models surpassing other text generation baselines. Our codes available at https://github.com/yychai74/LD-VAE.

Cite

Text

Chai et al. "Compositional Generalization for Multi-Label Text Classification: A Data-Augmentation Approach." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I16.29725

Markdown

[Chai et al. "Compositional Generalization for Multi-Label Text Classification: A Data-Augmentation Approach." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/chai2024aaai-compositional/) doi:10.1609/AAAI.V38I16.29725

BibTeX

@inproceedings{chai2024aaai-compositional,
  title     = {{Compositional Generalization for Multi-Label Text Classification: A Data-Augmentation Approach}},
  author    = {Chai, Yuyang and Li, Zhuang and Liu, Jiahui and Chen, Lei and Li, Fei and Ji, Donghong and Teng, Chong},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {17727-17735},
  doi       = {10.1609/AAAI.V38I16.29725},
  url       = {https://mlanthology.org/aaai/2024/chai2024aaai-compositional/}
}