Counterfactual User Sequence Synthesis Augmented with Continuous Time Dynamic Preference Modeling for Sequential POI Recommendation
Abstract
Video understanding seeks to enable machines to interpret visual content across three levels: action, event, and story. Existing models are limited in their ability to perform high-level long-term story understanding, due to (1) the oversimplified treatment of temporal information and (2) the training bias introduced by action/event-centric datasets. To address this, we introduce SCVBench, a novel benchmark for story-centric video understanding. SCVBench evaluates LVLMs through an event ordering task decomposed into sub-questions leading to a final question, quantitatively measuring historical dialogue exploration. We collected 1,253 final questions and 6,027 sub-question pairs from 925 videos, constructing continuous multi-turn dialogues. Experimental results show that while closed-source GPT-4o outperforms other models, most open-source LVLMs struggle with story-centric video understanding. Additionally, our StoryCoT model significantly surpasses open-source LVLMs on SCVBench. SCVBench aims to advance research by comprehensively analyzing LVLMs' temporal reasoning and comprehension capabilities. Code can be accessed at https://github.com/yuanrr/SCVBench.
Cite
Text
Qi et al. "Counterfactual User Sequence Synthesis Augmented with Continuous Time Dynamic Preference Modeling for Sequential POI Recommendation." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/255Markdown
[Qi et al. "Counterfactual User Sequence Synthesis Augmented with Continuous Time Dynamic Preference Modeling for Sequential POI Recommendation." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/qi2024ijcai-counterfactual/) doi:10.24963/ijcai.2024/255BibTeX
@inproceedings{qi2024ijcai-counterfactual,
title = {{Counterfactual User Sequence Synthesis Augmented with Continuous Time Dynamic Preference Modeling for Sequential POI Recommendation}},
author = {Qi, Lianyong and Liu, Yuwen and Liu, Weiming and Pei, Shichao and Xu, Xiaolong and Zhang, Xuyun and Wang, Yingjie and Dou, Wanchun},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2024},
pages = {2306-2314},
doi = {10.24963/ijcai.2024/255},
url = {https://mlanthology.org/ijcai/2024/qi2024ijcai-counterfactual/}
}