Discovering User Attribute Stylistic Differences via Paraphrasing

Abstract

User attribute prediction from social media text has proven successful and useful for downstream tasks. In previous studies, differences in user trait language use have been limited primarily to the presence or absence of words that indicate topical preferences. In this study, we aim to find linguistic style distinctions across three different user attributes: gender, age and occupational class. By combining paraphrases with a simple yet effective method, we capture a wide set of stylistic differences that are exempt from topic bias. We show their predictive power in user profiling, conformity with human perception and psycholinguistic hypotheses, and potential use in generating natural language tailored to specific user traits.

Cite

Text

Preotiuc-Pietro et al. "Discovering User Attribute Stylistic Differences via Paraphrasing." AAAI Conference on Artificial Intelligence, 2016. doi:10.1609/AAAI.V30I1.10393

Markdown

[Preotiuc-Pietro et al. "Discovering User Attribute Stylistic Differences via Paraphrasing." AAAI Conference on Artificial Intelligence, 2016.](https://mlanthology.org/aaai/2016/preotiucpietro2016aaai-discovering/) doi:10.1609/AAAI.V30I1.10393

BibTeX

@inproceedings{preotiucpietro2016aaai-discovering,
  title     = {{Discovering User Attribute Stylistic Differences via Paraphrasing}},
  author    = {Preotiuc-Pietro, Daniel and Xu, Wei and Ungar, Lyle H.},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2016},
  pages     = {3030-3037},
  doi       = {10.1609/AAAI.V30I1.10393},
  url       = {https://mlanthology.org/aaai/2016/preotiucpietro2016aaai-discovering/}
}