Zera-Shot Sentiment Analysis for Code-Mixed Data

Abstract

Code-mixing is the practice of alternating between two or more languages. A major part of sentiment analysis research has been monolingual and they perform poorly on the code-mixed text. We introduce methods that use multilingual and cross-lingual embeddings to transfer knowledge from monolingual text to code-mixed text for code-mixed sentiment analysis. Our methods handle code-mixed text through zero-shot learning and beat state-of-the-art English-Spanish code-mixed sentiment analysis by an absolute 3% F1-score. We are able to achieve 0.58 F1-score (without a parallel corpus) and 0.62 F1-score (with the parallel corpus) on the same benchmark in a zero-shot way as compared to 0.68 F1-score in supervised settings. Our code is publicly available on github.com/sedflix/unsacmt.

Cite

Text

Yadav and Chakraborty. "Zera-Shot Sentiment Analysis for Code-Mixed Data." AAAI Conference on Artificial Intelligence, 2021. doi:10.1609/AAAI.V35I18.17967

Markdown

[Yadav and Chakraborty. "Zera-Shot Sentiment Analysis for Code-Mixed Data." AAAI Conference on Artificial Intelligence, 2021.](https://mlanthology.org/aaai/2021/yadav2021aaai-zera/) doi:10.1609/AAAI.V35I18.17967

BibTeX

@inproceedings{yadav2021aaai-zera,
  title     = {{Zera-Shot Sentiment Analysis for Code-Mixed Data}},
  author    = {Yadav, Siddharth and Chakraborty, Tanmoy},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {15941-15942},
  doi       = {10.1609/AAAI.V35I18.17967},
  url       = {https://mlanthology.org/aaai/2021/yadav2021aaai-zera/}
}