FungiTastic: A Multi-Modal Dataset and Benchmark for Image Categorization
Abstract
We introduce a new, challenging benchmark and a dataset, FungiTastic, based on fungal records continuously collected over a twenty-year span. The dataset is labelled and curated by experts and consists of about 350k multimodal observations of 5k fine-grained categories (species). The fungi observations include photographs and additional data, e.g., meteorological and climatic data, satellite images, and body part segmentation masks. FungiTastic is one of the few benchmarks that include a test set with DNA-sequenced ground truth of unprecedented label reliability. The benchmark is designed to support (i) standard closed-set classification, (ii) open-set classification, (iii) multi-modal classification, (iv) few-shot learning, (v) domain shift, and many more. We provide tailored baselines for many use cases, a multitude of ready-to-use pre-trained models on HuggingFace, and a framework for model training. The documentation and the baselines are available at GitHub and Kaggle.
Cite
Text
Picek et al. "FungiTastic: A Multi-Modal Dataset and Benchmark for Image Categorization." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.Markdown
[Picek et al. "FungiTastic: A Multi-Modal Dataset and Benchmark for Image Categorization." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.](https://mlanthology.org/cvprw/2025/picek2025cvprw-fungitastic/)BibTeX
@inproceedings{picek2025cvprw-fungitastic,
title = {{FungiTastic: A Multi-Modal Dataset and Benchmark for Image Categorization}},
author = {Picek, Lukás and Janousková, Klára and Cermák, Vojtech and Matas, Jiri},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2025},
pages = {2046-2056},
url = {https://mlanthology.org/cvprw/2025/picek2025cvprw-fungitastic/}
}