Enhancing Length Generalization for Attention Based Knowledge Tracing Models with Linear Biases

Abstract

The goal of the image change captioning task is to capture the differences between two similar images and describe them in natural language. In this paper, we decompose this task into two sub-problems, i.e., fine-grained change feature learning and discrimination of changed regions. Compared with existing methods which only focus on change feature learning, we propose a novel change captioning learning paradigm, Find and Perceive (F&P). Our proposed F&P consists of two main ideas, i.e., the Fine-Grained Semantic Change Perception (FGSCP) module for improving the model's perception ability of subtle changes and the Weakly-Supervised Discriminator (WSD) of changed regions for improving the model's sensitivity of localising the important regions. Specifically, the FGSCP deploys a two-step manner, firstly introducing the fine-grained categorisation and then enhancing the interaction of the two paired images. And the WSD adopts the contributions of each image region for final generated captions, accurately indicating which regions are important for change captions without any extra annotations. Finally, we conduct extensive experiments on four change captioning datasets, and experimental results show that our proposed method F&P outperforms existing change caption methods and achieves new state-of-the-art performance.

Cite

Text

Li et al. "Enhancing Length Generalization for Attention Based Knowledge Tracing Models with Linear Biases." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/654

Markdown

[Li et al. "Enhancing Length Generalization for Attention Based Knowledge Tracing Models with Linear Biases." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/li2024ijcai-enhancing/) doi:10.24963/ijcai.2024/654

BibTeX

@inproceedings{li2024ijcai-enhancing,
  title     = {{Enhancing Length Generalization for Attention Based Knowledge Tracing Models with Linear Biases}},
  author    = {Li, Xueyi and Bai, Youheng and Guo, Teng and Liu, Zitao and Huang, Yaying and Zhao, Xiangyu and Xia, Feng and Luo, Weiqi and Weng, Jian},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {5918-5926},
  doi       = {10.24963/ijcai.2024/654},
  url       = {https://mlanthology.org/ijcai/2024/li2024ijcai-enhancing/}
}