Towards Natural Machine Unlearning

He, Zhengbao; Li, Tao; Cheng, Xinwen; Huang, Zhehao; Huang, Xiaolin

Towards Natural Machine Unlearning

Zhengbao He, Tao Li, Xinwen Cheng, Zhehao Huang, Xiaolin Huang

NeurIPSW 2024

/neuripsw/2024/he2024neuripsw-natural/

Abstract

Machine unlearning (MU) aims to eliminate information that has been learned from specific training data, namely forgetting data, from a pre-trained model. Currently, the mainstream of existing MU methods involves modifying the forgetting data with incorrect labels and subsequently fine-tuning the model. While learning such incorrect information can indeed remove knowledge, the process is quite unnatural as the unlearning process undesirably reinforces the incorrect information and leads to over-forgetting. Towards more *natural* machine unlearning, we inject correct information from the remaining data to the forgetting samples when changing their labels. Through pairing these adjusted samples with their labels, the model will tend to use the injected correct information and naturally suppress the information meant to be forgotten. Albeit straightforward, such a first step towards natural machine unlearning can significantly outperform current state-of-the-art approaches. In particular, our method substantially reduces the over-forgetting and leads to stable performance in various unlearning settings, making it a promising candidate for practical machine unlearning.

PDF NeurIPSW OpenReview Semantic Scholar

Cite

Text

He et al. "Towards Natural Machine Unlearning." NeurIPS 2024 Workshops: FITML, 2024.

Markdown

[He et al. "Towards Natural Machine Unlearning." NeurIPS 2024 Workshops: FITML, 2024.](https://mlanthology.org/neuripsw/2024/he2024neuripsw-natural/)

BibTeX

@inproceedings{he2024neuripsw-natural,
  title     = {{Towards Natural Machine Unlearning}},
  author    = {He, Zhengbao and Li, Tao and Cheng, Xinwen and Huang, Zhehao and Huang, Xiaolin},
  booktitle = {NeurIPS 2024 Workshops: FITML},
  year      = {2024},
  url       = {https://mlanthology.org/neuripsw/2024/he2024neuripsw-natural/}
}