Roll-Drop: Accounting for Observation Noise with a Single Parameter

Abstract

This paper proposes a simple strategy for sim-to-real in Deep-Reinforcement Learning (DRL) – called Roll-Drop – that uses dropout during simulation to account for observation noise during deployment without explicitly modelling its distribution for each state. DRL is a promising approach to control robots for highly dynamic and feedback-based manoeuvres, and accurate simulators are crucial to providing cheap and abundant data to learn the desired behaviour. Nevertheless, the simulated data are noiseless and generally show a distributional shift that challenges the deployment on real machines where sensor readings are affected by noise. The standard solution is modelling the latter and injecting it during training; while this requires a thorough system identification, Roll-Drop enhances the robustness to sensor noise by tuning only a single parameter. We demonstrate an 80% success rate when up to 25% noise is injected in the observations, with twice higher robustness than the baselines. We deploy the controller trained in simulation on a Unitree A1 platform and assess this improved robustness on the physical system. Additional resources at: https://sites.google.com/oxfordrobotics.institute/roll-drop

Cite

Text

Campanaro et al. "Roll-Drop: Accounting for Observation Noise with a Single Parameter." Proceedings of The 5th Annual Learning for Dynamics and Control Conference, 2023.

Markdown

[Campanaro et al. "Roll-Drop: Accounting for Observation Noise with a Single Parameter." Proceedings of The 5th Annual Learning for Dynamics and Control Conference, 2023.](https://mlanthology.org/l4dc/2023/campanaro2023l4dc-rolldrop/)

BibTeX

@inproceedings{campanaro2023l4dc-rolldrop,
  title     = {{Roll-Drop: Accounting for Observation Noise with a Single Parameter}},
  author    = {Campanaro, Luigi and De Martini, Daniele and Gangapurwala, Siddhant and Merkt, Wolfgang and Havoutis, Ioannis},
  booktitle = {Proceedings of The 5th Annual Learning for Dynamics and Control Conference},
  year      = {2023},
  pages     = {718-730},
  volume    = {211},
  url       = {https://mlanthology.org/l4dc/2023/campanaro2023l4dc-rolldrop/}
}