Probabilistic Masked Attention Networks for Explainable Sequential Recommendation

Abstract

Transformer-based models are powerful for modeling temporal dynamics of user preference in sequential recommendation. Most of the variants adopt the Softmax transformation in the self-attention layers to generate dense attention probabilities. However, real-world item sequences are often noisy, containing a mixture of true-positive and false-positive interactions. Such dense attentions inevitably assign probability mass to noisy or irrelevant items, leading to sub-optimal performance and poor explainability. Here we propose a Probabilistic Masked Attention Network (PMAN) to identify the sparse pattern of attentions, which is more desirable for pruning noisy items in sequential recommendation. Specifically, we employ a probabilistic mask to achieve sparse attentions under a constrained optimization framework. As such, PMAN allows to select which information is critical to be retained or dropped in a data-driven fashion. Experimental studies on real-world benchmark datasets show that PMAN is able to improve the performance of Transformers significantly.

Cite

Text

Chen et al. "Probabilistic Masked Attention Networks for Explainable Sequential Recommendation." International Joint Conference on Artificial Intelligence, 2023. doi:10.24963/IJCAI.2023/230

Markdown

[Chen et al. "Probabilistic Masked Attention Networks for Explainable Sequential Recommendation." International Joint Conference on Artificial Intelligence, 2023.](https://mlanthology.org/ijcai/2023/chen2023ijcai-probabilistic/) doi:10.24963/IJCAI.2023/230

BibTeX

@inproceedings{chen2023ijcai-probabilistic,
  title     = {{Probabilistic Masked Attention Networks for Explainable Sequential Recommendation}},
  author    = {Chen, Huiyuan and Zhou, Kaixiong and Jiang, Zhimeng and Yeh, Chin-Chia Michael and Li, Xiaoting and Pan, Menghai and Zheng, Yan and Hu, Xia and Yang, Hao},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2023},
  pages     = {2068-2076},
  doi       = {10.24963/IJCAI.2023/230},
  url       = {https://mlanthology.org/ijcai/2023/chen2023ijcai-probabilistic/}
}