Visual Question Answering for Peruvian Cuisine in Regional Spanish

Abstract

This project leverages Visual Question Answering (VQA) to promote Peruvian gastronomy by utilizing a culturally rich dataset and advanced models such as LLaVA-1.5 and GPT-2 Large. The evaluation will comprise both automated metrics and culinary expert assessments. This system addresses regional variations in dish names, promotes inclusivity by involving Peruvians from diverse regions in dataset construction, and enhances cultural representation.

Cite

Text

Cosavalente. "Visual Question Answering for Peruvian Cuisine in Regional Spanish." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I28.35339

Markdown

[Cosavalente. "Visual Question Answering for Peruvian Cuisine in Regional Spanish." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/cosavalente2025aaai-visual/) doi:10.1609/AAAI.V39I28.35339

BibTeX

@inproceedings{cosavalente2025aaai-visual,
  title     = {{Visual Question Answering for Peruvian Cuisine in Regional Spanish}},
  author    = {Cosavalente, Mariana Risco},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {29602-29604},
  doi       = {10.1609/AAAI.V39I28.35339},
  url       = {https://mlanthology.org/aaai/2025/cosavalente2025aaai-visual/}
}