Towards a Defense Against Federated Backdoor Attacks Under Continuous Training
Abstract
Backdoor attacks are dangerous and difficult to prevent in federated learning (FL), where training data is sourced from untrusted clients over long periods of time. These difficulties arise because: (a) defenders in FL do not have access to raw training data, and (b) a phenomenon we identify called backdoor leakage causes models trained continuously to eventually suffer from backdoors due to cumulative errors in defense mechanisms. We propose a framework called shadow learning for defending against backdoor attacks in the FL setting under long-range training. Shadow learning trains two models in parallel: a backbone model and a shadow model. The backbone is trained without any defense mechanism to obtain good performance on the main task. The shadow model combines filtering of malicious clients with early-stopping to control the attack success rate even as the data distribution changes. We theoretically motivate our design and show experimentally that our framework significantly improves upon existing defenses against backdoor attacks.
Cite
Text
Wang et al. "Towards a Defense Against Federated Backdoor Attacks Under Continuous Training." Transactions on Machine Learning Research, 2023.Markdown
[Wang et al. "Towards a Defense Against Federated Backdoor Attacks Under Continuous Training." Transactions on Machine Learning Research, 2023.](https://mlanthology.org/tmlr/2023/wang2023tmlr-defense/)BibTeX
@article{wang2023tmlr-defense,
title = {{Towards a Defense Against Federated Backdoor Attacks Under Continuous Training}},
author = {Wang, Shuaiqi and Hayase, Jonathan and Fanti, Giulia and Oh, Sewoong},
journal = {Transactions on Machine Learning Research},
year = {2023},
url = {https://mlanthology.org/tmlr/2023/wang2023tmlr-defense/}
}