Visual Question Answering for Peruvian Cuisine in Regional Spanish
Abstract
This project leverages Visual Question Answering (VQA) to promote Peruvian gastronomy by utilizing a culturally rich dataset and advanced models such as LLaVA-1.5 and GPT-2 Large. The evaluation will comprise both automated metrics and culinary expert assessments. This system addresses regional variations in dish names, promotes inclusivity by involving Peruvians from diverse regions in dataset construction, and enhances cultural representation.
Cite
Text
Cosavalente. "Visual Question Answering for Peruvian Cuisine in Regional Spanish." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I28.35339Markdown
[Cosavalente. "Visual Question Answering for Peruvian Cuisine in Regional Spanish." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/cosavalente2025aaai-visual/) doi:10.1609/AAAI.V39I28.35339BibTeX
@inproceedings{cosavalente2025aaai-visual,
title = {{Visual Question Answering for Peruvian Cuisine in Regional Spanish}},
author = {Cosavalente, Mariana Risco},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {29602-29604},
doi = {10.1609/AAAI.V39I28.35339},
url = {https://mlanthology.org/aaai/2025/cosavalente2025aaai-visual/}
}