OUCopula: Bi-Channel Multi-Label Copula-Enhanced Adapter-Based CNN for Myopia Screening Based on OU-UWF Images

Li, Yang; Huang, Qiuyi; Zhong, Chong; Yang, Danjuan; Li, Meiyan; Welsh, Alan H.; Liu, Aiyi; Fu, Bo; Liu, Catherine C.; Zhou, Xingtao

doi:10.24963/ijcai.2024/655

OUCopula: Bi-Channel Multi-Label Copula-Enhanced Adapter-Based CNN for Myopia Screening Based on OU-UWF Images

Yang Li, Qiuyi Huang, Chong Zhong, Danjuan Yang, Meiyan Li, Alan H. Welsh, Aiyi Liu, Bo Fu, Catherine C. Liu, Xingtao Zhou

IJCAI 2024 pp. 5927-5935

doi:10.24963/ijcai.2024/655 /ijcai/2024/li2024ijcai-oucopula/

Abstract

Vision-language models (VLMs) have exhibited remarkable generalization capabilities, and prompt learning for VLMs has attracted great attention for the ability to adapt pre-trained VLMs to specific downstream tasks. However, existing studies mainly focus on single-modal prompts or uni-directional modality interaction, overlooking the powerful alignment effects resulting from the interaction between the vision and language modalities. To this end, we propose a novel prompt learning method called Bi-directional Modality Interaction Prompt (BMIP), which dynamically weights bi-modal information through learning the information of the attention layer, enhancing trainability and inter-modal consistency compared to simple information aggregation methods. To evaluate the effectiveness of prompt learning methods, we propose a more realistic evaluation paradigm called open-world generalization complementing the widely adopted cross-dataset transfer and domain generalization tasks. Comprehensive experiments on various datasets reveal that BMIP not only outperforms current state-of-the-art methods across all three evaluation paradigms but is also flexible enough to be combined with other prompt-based methods for consistent performance enhancement.

PDF IJCAI Semantic Scholar

Cite

Text

Li et al. "OUCopula: Bi-Channel Multi-Label Copula-Enhanced Adapter-Based CNN for Myopia Screening Based on OU-UWF Images." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/655

Markdown

[Li et al. "OUCopula: Bi-Channel Multi-Label Copula-Enhanced Adapter-Based CNN for Myopia Screening Based on OU-UWF Images." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/li2024ijcai-oucopula/) doi:10.24963/ijcai.2024/655

BibTeX

@inproceedings{li2024ijcai-oucopula,
  title     = {{OUCopula: Bi-Channel Multi-Label Copula-Enhanced Adapter-Based CNN for Myopia Screening Based on OU-UWF Images}},
  author    = {Li, Yang and Huang, Qiuyi and Zhong, Chong and Yang, Danjuan and Li, Meiyan and Welsh, Alan H. and Liu, Aiyi and Fu, Bo and Liu, Catherine C. and Zhou, Xingtao},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {5927-5935},
  doi       = {10.24963/ijcai.2024/655},
  url       = {https://mlanthology.org/ijcai/2024/li2024ijcai-oucopula/}
}