Multi-Modal Sarcasm Detection Based on Dual Generative Processes
Abstract
Class-agnostic counting enables enumerating arbitrary object classes beyond those seen during training. Recent studies attempted to exploit the potential of visual foundation models such as GroundingDINO. Despite the considerable progress, we observe certain shortcomings, including the limited diversity of visual prompts and suboptimal training regimen. To address these issues, we introduce VQCounter, which incorporates a visual prompt queue mechanism designed to enrich the diversity of visual prompts. A random modality switching strategy is proposed during training to strengthen both textual and visual modalities. Besides, in light of weak point supervision, a Voronoi diagram-based cost (VoronoiCost) is designed to improve Hungarian matching, leading to more stable and faster convergence. Building upon the Voronoi diagram, we also propose a novel set of more stringent evaluation metrics, which take point localization into account. Extensive experiments on the FSC-147 and CARPK datasets demonstrate that VQCounter achieves state-of-the-art performance in both zero-shot and few-shot settings, significantly outperforming existing methods across nearly all evaluations.
Cite
Text
Ma et al. "Multi-Modal Sarcasm Detection Based on Dual Generative Processes." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/252Markdown
[Ma et al. "Multi-Modal Sarcasm Detection Based on Dual Generative Processes." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/ma2024ijcai-multi/) doi:10.24963/ijcai.2024/252BibTeX
@inproceedings{ma2024ijcai-multi,
title = {{Multi-Modal Sarcasm Detection Based on Dual Generative Processes}},
author = {Ma, Huiying and He, Dongxiao and Wang, Xiaobao and Jin, Di and Ge, Meng and Wang, Longbiao},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2024},
pages = {2279-2287},
doi = {10.24963/ijcai.2024/252},
url = {https://mlanthology.org/ijcai/2024/ma2024ijcai-multi/}
}