Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training
Abstract
The goal of stance detection is to determine the viewpoint expressed in a piece of text towards a target. These viewpoints or contexts are often expressed in many different languages depending on the user and the platform, which can be a local news outlet, a social media platform, a news forum, etc. Most research on stance detection, however, has been limited to working with a single language and on a few limited targets, with little work on cross-lingual stance detection. Moreover, non-English sources of labelled data are often scarce and present additional challenges. Recently, large multilingual language models have substantially improved the performance on many non-English tasks, especially such with a limited number of examples. This highlights the importance of model pre-training and its ability to learn from few examples. In this paper, we present the most comprehensive study of cross-lingual stance detection to date: we experiment with 15 diverse datasets in 12 languages from 6 language families, and with 6 low-resource evaluation settings each. For our experiments, we build on pattern-exploiting training (PET), proposing the addition of a novel label encoder to simplify the verbalisation procedure. We further propose sentiment-based generation of stance data for pre-training, which shows sizeable improvement of more than 6% F1 absolute in few-shot learning settings compared to several strong baselines.
Cite
Text
Hardalov et al. "Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training." AAAI Conference on Artificial Intelligence, 2022. doi:10.1609/AAAI.V36I10.21318Markdown
[Hardalov et al. "Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training." AAAI Conference on Artificial Intelligence, 2022.](https://mlanthology.org/aaai/2022/hardalov2022aaai-few/) doi:10.1609/AAAI.V36I10.21318BibTeX
@inproceedings{hardalov2022aaai-few,
title = {{Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training}},
author = {Hardalov, Momchil and Arora, Arnav and Nakov, Preslav and Augenstein, Isabelle},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2022},
pages = {10729-10737},
doi = {10.1609/AAAI.V36I10.21318},
url = {https://mlanthology.org/aaai/2022/hardalov2022aaai-few/}
}