Detection of Man-in-the-Middle Attacks in Model-Free Reinforcement Learning

Abstract

This paper proposes a Bellman Deviation algorithm for the detection of man-in-the-middle (MITM) attacks occurring when an agent controls a Markov Decision Process (MDP) system using model-free reinforcement learning. This algorithm is derived by constructing a "Bellman Deviation sequence" and finding stochastic bounds on its running sequence average. We show that an intuitive, necessary and sufficient "informational advantage" condition must be met for the proposed algorithm to guarantee the detection of attacks with high probability, while also avoiding false alarms.

Cite

Text

Rani and Franceschetti. "Detection of Man-in-the-Middle Attacks in Model-Free Reinforcement Learning." Proceedings of The 5th Annual Learning for Dynamics and Control Conference, 2023.

Markdown

[Rani and Franceschetti. "Detection of Man-in-the-Middle Attacks in Model-Free Reinforcement Learning." Proceedings of The 5th Annual Learning for Dynamics and Control Conference, 2023.](https://mlanthology.org/l4dc/2023/rani2023l4dc-detection/)

BibTeX

@inproceedings{rani2023l4dc-detection,
  title     = {{Detection of Man-in-the-Middle Attacks in Model-Free Reinforcement Learning}},
  author    = {Rani, Rishi and Franceschetti, Massimo},
  booktitle = {Proceedings of The 5th Annual Learning for Dynamics and Control Conference},
  year      = {2023},
  pages     = {993-1007},
  volume    = {211},
  url       = {https://mlanthology.org/l4dc/2023/rani2023l4dc-detection/}
}