Radioactive Data: Tracing Through Training

Abstract

Data tracing determines whether particular data samples have been used to train a model. We propose a new technique, radioactive data, that makes imperceptible changes to these samples such that any model trained on them will bear an identifiable mark. Given a trained model, our technique detects the use of radioactive data and provides a level of confidence (p-value). Experiments on large-scale benchmarks (Imagenet), with standard architectures (Resnet-18, VGG-16, Densenet-121) and training procedures, show that we detect radioactive data with high confidence (p<0.0001) when only 1% of the data used to train a model is radioactive. Our radioactive mark is resilient to strong data augmentations and variations of the model architecture. As a result, it offers a much higher signal-to-noise ratio than data poisoning and backdoor methods.

Cite

Text

Sablayrolles et al. "Radioactive Data: Tracing Through Training." International Conference on Machine Learning, 2020.

Markdown

[Sablayrolles et al. "Radioactive Data: Tracing Through Training." International Conference on Machine Learning, 2020.](https://mlanthology.org/icml/2020/sablayrolles2020icml-radioactive/)

BibTeX

@inproceedings{sablayrolles2020icml-radioactive,
  title     = {{Radioactive Data: Tracing Through Training}},
  author    = {Sablayrolles, Alexandre and Douze, Matthijs and Schmid, Cordelia and Jegou, Herve},
  booktitle = {International Conference on Machine Learning},
  year      = {2020},
  pages     = {8326-8335},
  volume    = {119},
  url       = {https://mlanthology.org/icml/2020/sablayrolles2020icml-radioactive/}
}