Watermarking for Out-of-Distribution Detection

Abstract

Out-of-distribution (OOD) detection aims to identify OOD data based on representations extracted from well-trained deep models. However, existing methods largely ignore the reprogramming property of deep models and thus may not fully unleash their intrinsic strength: without modifying parameters of a well-trained deep model, we can reprogram this model for a new purpose via data-level manipulation (e.g., adding a specific feature perturbation). This property motivates us to reprogram a classification model to excel at OOD detection (a new task), and thus we propose a general methodology named watermarking in this paper. Specifically, we learn a unified pattern that is superimposed onto features of original data, and the model's detection capability is largely boosted after watermarking. Extensive experiments verify the effectiveness of watermarking, demonstrating the significance of the reprogramming property of deep models in OOD detection.

Cite

Text

Wang et al. "Watermarking for Out-of-Distribution Detection." Neural Information Processing Systems, 2022.

Markdown

[Wang et al. "Watermarking for Out-of-Distribution Detection." Neural Information Processing Systems, 2022.](https://mlanthology.org/neurips/2022/wang2022neurips-watermarking/)

BibTeX

@inproceedings{wang2022neurips-watermarking,
  title     = {{Watermarking for Out-of-Distribution Detection}},
  author    = {Wang, Qizhou and Liu, Feng and Zhang, Yonggang and Zhang, Jing and Gong, Chen and Liu, Tongliang and Han, Bo},
  booktitle = {Neural Information Processing Systems},
  year      = {2022},
  url       = {https://mlanthology.org/neurips/2022/wang2022neurips-watermarking/}
}