BESTMVQA: A Benchmark Evaluation System for Medical Visual Question Answering

Abstract

Medical Visual Question Answering (Med-VQA) is a task that answers a natural language question with a medical image. Existing VQA techniques can be directly applied to solving the task. However, they often suffer from ( i ) the data insufficient problem, which makes it difficult to train the state of the arts (SOTAs) for domain-specific tasks, and ( ii ) the reproducibility problem, that existing models have not been thoroughly evaluated in a unified experimental setup. To address the issues, we develop a Benchmark Evaluation SysTem for Medical Visual Question Answering, denoted by BESTMVQA. Given clinical data, our system provides a useful tool for users to automatically build Med-VQA datasets. Users can conveniently select a wide spectrum of models from our library to perform a comprehensive evaluation study. With simple configurations, our system can automatically train and evaluate the selected models over a benchmark dataset, and reports the comprehensive results for users to develop new techniques or perform medical practice. Limitations of existing work are overcome ( i ) by the data generation tool, which automatically constructs new datasets from unstructured clinical data, and ( ii ) by evaluating SOTAs on benchmark datasets in a unified experimental setup. The demonstration video of our system can be found at https://youtu.be/QkEeFlu1x4A , and the source code is shared on https://github.com/emmali808/BESTMVQA .

Cite

Text

Hong et al. "BESTMVQA: A Benchmark Evaluation System for Medical Visual Question Answering." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024. doi:10.1007/978-3-031-70378-2_27

Markdown

[Hong et al. "BESTMVQA: A Benchmark Evaluation System for Medical Visual Question Answering." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024.](https://mlanthology.org/ecmlpkdd/2024/hong2024ecmlpkdd-bestmvqa/) doi:10.1007/978-3-031-70378-2_27

BibTeX

@inproceedings{hong2024ecmlpkdd-bestmvqa,
  title     = {{BESTMVQA: A Benchmark Evaluation System for Medical Visual Question Answering}},
  author    = {Hong, Xiaojie and Song, Zixin and Li, Liangzhi and Wang, Xiaoli and Liu, Feiyan},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2024},
  pages     = {435-451},
  doi       = {10.1007/978-3-031-70378-2_27},
  url       = {https://mlanthology.org/ecmlpkdd/2024/hong2024ecmlpkdd-bestmvqa/}
}