Deep k-NN Defense Against Clean-Label Data Poisoning Attacks

Abstract

Targeted clean-label data poisoning is a type of adversarial attack on machine learning systems in which an adversary injects a few correctly-labeled, minimally-perturbed samples into the training data, causing a model to misclassify a particular test sample during inference. Although defenses have been proposed for general poisoning attacks, no reliable defense for clean-label attacks has been demonstrated, despite the attacks’ effectiveness and realistic applications. In this work, we propose a simple, yet highly-effective Deep k -NN defense against both feature collision and convex polytope clean-label attacks on the CIFAR-10 dataset. We demonstrate that our proposed strategy is able to detect over 99% of poisoned examples in both attacks and remove them without compromising model performance. Additionally, through ablation studies, we discover simple guidelines for selecting the value of k as well as for implementing the Deep k -NN defense on real-world datasets with class imbalance. Our proposed defense shows that current clean-label poisoning attack strategies can be annulled, and serves as a strong yet simple-to-implement baseline defense to test future clean-label poisoning attacks. Our code is available on GitHub .

Cite

Text

Peri et al. "Deep k-NN Defense Against Clean-Label Data Poisoning Attacks." European Conference on Computer Vision Workshops, 2020. doi:10.1007/978-3-030-66415-2_4

Markdown

[Peri et al. "Deep k-NN Defense Against Clean-Label Data Poisoning Attacks." European Conference on Computer Vision Workshops, 2020.](https://mlanthology.org/eccvw/2020/peri2020eccvw-deep/) doi:10.1007/978-3-030-66415-2_4

BibTeX

@inproceedings{peri2020eccvw-deep,
  title     = {{Deep k-NN Defense Against Clean-Label Data Poisoning Attacks}},
  author    = {Peri, Neehar and Gupta, Neal and Huang, W. Ronny and Fowl, Liam and Zhu, Chen and Feizi, Soheil and Goldstein, Tom and Dickerson, John P.},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2020},
  pages     = {55-70},
  doi       = {10.1007/978-3-030-66415-2_4},
  url       = {https://mlanthology.org/eccvw/2020/peri2020eccvw-deep/}
}