Maximal Brain Damage Without Data or Optimization: Disrupting Neural Networks via Sign-Bit Flips
Abstract
Deep Neural Networks (DNNs) can be catastrophically disrupted by flipping only a handful of sign bits in their parameters. We introduce Deep Neural Lesion (DNL), a data-free, lightweight method that locates these critical parameters and triggers massive accuracy drops. We validate its efficacy on a wide variety of computer vision models and datasets. The method requires no training data or optimization and can be carried out via common exploits software, firmware or hardware based attack vectors. An enhanced variant that uses a single forward and backward pass further amplifies the damage beyond DNL's zero-pass approach. Flipping just two sign bits in ResNet50 on ImageNet reduces accuracy by 99.8%. We also show that selectively protecting a small fraction of vulnerable sign bits provides a practical defense against such attacks.
Cite
Text
Galil et al. "Maximal Brain Damage Without Data or Optimization: Disrupting Neural Networks via Sign-Bit Flips." Transactions on Machine Learning Research, 2026.Markdown
[Galil et al. "Maximal Brain Damage Without Data or Optimization: Disrupting Neural Networks via Sign-Bit Flips." Transactions on Machine Learning Research, 2026.](https://mlanthology.org/tmlr/2026/galil2026tmlr-maximal/)BibTeX
@article{galil2026tmlr-maximal,
title = {{Maximal Brain Damage Without Data or Optimization: Disrupting Neural Networks via Sign-Bit Flips}},
author = {Galil, Ido and Kimhi, Moshe and El-Yaniv, Ran},
journal = {Transactions on Machine Learning Research},
year = {2026},
url = {https://mlanthology.org/tmlr/2026/galil2026tmlr-maximal/}
}