CircuitVQA: A Visual Question Answering Dataset for Electrical Circuit Images
Abstract
A visual question answering (VQA) system for electrical circuit images could be useful as a quiz generator, design and verification assistant or an electrical diagnosis tool. Although there exists a vast literature on VQA, to the best of our knowledge, there is no existing work on VQA for electrical circuit images. To this end, we curate a new dataset, CircuitVQA , of 115K+ questions on 5725 electrical images with $\sim $ ∼ 70 circuit symbols. The dataset contains schematic as well as hand-drawn images. The questions span various categories like counting, value, junction and position based questions. To be effective, models must demonstrate skills like object detection, text recognition, spatial understanding, question intent understanding and answer generation. We experiment with multiple foundational visio-linguistic models for this task and find that a finetuned BLIP model with component descriptions as additional input provides best results. We make the code and dataset publicly available ( https://github.com/rahcode7/Circuit-VQA ).
Cite
Text
Mehta et al. "CircuitVQA: A Visual Question Answering Dataset for Electrical Circuit Images." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024. doi:10.1007/978-3-031-70341-6_26Markdown
[Mehta et al. "CircuitVQA: A Visual Question Answering Dataset for Electrical Circuit Images." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024.](https://mlanthology.org/ecmlpkdd/2024/mehta2024ecmlpkdd-circuitvqa/) doi:10.1007/978-3-031-70341-6_26BibTeX
@inproceedings{mehta2024ecmlpkdd-circuitvqa,
title = {{CircuitVQA: A Visual Question Answering Dataset for Electrical Circuit Images}},
author = {Mehta, Rahul and Singh, Bhavyajeet and Varma, Vasudeva and Gupta, Manish},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2024},
pages = {440-460},
doi = {10.1007/978-3-031-70341-6_26},
url = {https://mlanthology.org/ecmlpkdd/2024/mehta2024ecmlpkdd-circuitvqa/}
}