DermaCon-in: A Multiconcept-Annotated Dermatological Image Dataset of Indian Skin Disorders for Clinical AI Research

Abstract

Artificial intelligence is poised to augment dermatological care by enabling scalable image-based diagnostics. Yet, the development of robust and equitable models remains hindered by datasets that fail to capture the clinical and demographic complexity of real-world practice. This complexity stems from region-specific disease distributions, wide variation in skin tones, and the underrepresentation of outpatient scenarios from non-Western populations. We introduce DermaCon-IN, a prospectively curated dermatology dataset comprising 5,450 clinical images from 3,002 patients across outpatient clinics in South India. Each image is annotated by board-certified dermatologists with 245 distinct diagnoses, structured under a hierarchical, etiology-based taxonomy adapted from Rook’s classification. The dataset captures a wide spectrum of dermatologic conditions and tonal variation commonly seen in Indian outpatient care. We benchmark a range of architectures, including convolutional models (ResNet, DenseNet, EfficientNet), transformer-based models (ViT, MaxViT, Swin), and Concept Bottleneck Models to establish baseline performance and explore how anatomical and concept-level cues may be integrated. These results are intended to guide future efforts toward interpretable and clinically realistic models. DermaCon-IN provides a scalable and representative foundation for advancing dermatology AI in real-world settings.

Cite

Text

Madarkar et al. "DermaCon-in: A Multiconcept-Annotated Dermatological Image Dataset of Indian Skin Disorders for Clinical AI Research." Advances in Neural Information Processing Systems, 2025.

Markdown

[Madarkar et al. "DermaCon-in: A Multiconcept-Annotated Dermatological Image Dataset of Indian Skin Disorders for Clinical AI Research." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/madarkar2025neurips-dermaconin/)

BibTeX

@inproceedings{madarkar2025neurips-dermaconin,
  title     = {{DermaCon-in: A Multiconcept-Annotated Dermatological Image Dataset of Indian Skin Disorders for Clinical AI Research}},
  author    = {Madarkar, Shanawaj S and Madarkar, Mahajabeen and V, Madhumitha and Prakash, Teli and Mopuri, Konda Reddy and Mv, Vinaykumar and Sathwika, Kota Venkata Lakshmi and Kasturi, Adarsh and Raj, Gandla Dilip and Supranitha, Padharthi Venkata Naga and Udai, Harsh},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/madarkar2025neurips-dermaconin/}
}