Deep Classifier Mimicry Without Data Access

Abstract

Access to pre-trained models has recently emerged as a standard across numerous machine learning domains. Unfortunately, access to the original data the models were trained on may not equally be granted. This makes it tremendously challenging to fine-tune, compress models, adapt continually, or to do any other type of data-driven update. We posit that original data access may however not be required. Specifically, we propose Contrastive Abductive Knowledge Extraction (CAKE), a model-agnostic knowledge distillation procedure that mimics deep classifiers without access to the original data. To this end, CAKE generates pairs of noisy synthetic samples and diffuses them contrastively toward a model’s decision boundary. We empirically corroborate CAKE’s effectiveness using several benchmark datasets and various architectural choices, paving the way for broad application.

Cite

Text

Braun et al. "Deep Classifier Mimicry Without Data Access." Artificial Intelligence and Statistics, 2024.

Markdown

[Braun et al. "Deep Classifier Mimicry Without Data Access." Artificial Intelligence and Statistics, 2024.](https://mlanthology.org/aistats/2024/braun2024aistats-deep/)

BibTeX

@inproceedings{braun2024aistats-deep,
  title     = {{Deep Classifier Mimicry Without Data Access}},
  author    = {Braun, Steven and Mundt, Martin and Kersting, Kristian},
  booktitle = {Artificial Intelligence and Statistics},
  year      = {2024},
  pages     = {4762-4770},
  volume    = {238},
  url       = {https://mlanthology.org/aistats/2024/braun2024aistats-deep/}
}