Distribution Matching for Rationalization
Abstract
The task of rationalization aims to extract pieces of input text as rationales to justify neural network predictions on text classification tasks. By definition, rationales represent key text pieces used for prediction and thus should have similar classification feature distribution compared to the original input text. However, previous methods mainly focused on maximizing the mutual information between rationales and labels while neglecting the relationship between rationales and input text. To address this issue, we propose a novel rationalization method that matches the distributions of rationales and input text in both the feature space and output space. Empirically, the proposed distribution matching approach consistently outperforms previous methods by a large margin. Our data and code are available.
Cite
Text
Huang et al. "Distribution Matching for Rationalization." AAAI Conference on Artificial Intelligence, 2021. doi:10.1609/AAAI.V35I14.17547Markdown
[Huang et al. "Distribution Matching for Rationalization." AAAI Conference on Artificial Intelligence, 2021.](https://mlanthology.org/aaai/2021/huang2021aaai-distribution/) doi:10.1609/AAAI.V35I14.17547BibTeX
@inproceedings{huang2021aaai-distribution,
title = {{Distribution Matching for Rationalization}},
author = {Huang, Yongfeng and Chen, Yujun and Du, Yulun and Yang, Zhilin},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2021},
pages = {13090-13097},
doi = {10.1609/AAAI.V35I14.17547},
url = {https://mlanthology.org/aaai/2021/huang2021aaai-distribution/}
}