EvoControl: Multi-Frequency Bi-Level Control for High-Frequency Continuous Control

Abstract

High-frequency control in continuous action and state spaces is essential for practical applications in the physical world. Directly applying end-to-end reinforcement learning to high-frequency control tasks struggles with assigning credit to actions across long temporal horizons, compounded by the difficulty of efficient exploration. The alternative, learning low-frequency policies that guide higher-frequency controllers (e.g., proportional-derivative (PD) controllers), can result in a limited total expressiveness of the combined control system, hindering overall performance. We introduce EvoControl, a novel bi-level policy learning framework for learning both a slow high-level policy (using PPO) and a fast low-level policy (using Evolution Strategies) for solving continuous control tasks. Learning with Evolution Strategies for the lower-policy allows robust learning for long horizons that crucially arise when operating at higher frequencies. This enables EvoControl to learn to control interactions at a high frequency, benefitting from more efficient exploration and credit assignment than direct high-frequency torque control without the need to hand-tune PD parameters. We empirically demonstrate that EvoControl can achieve a higher evaluation reward for continuous-control tasks compared to existing approaches, specifically excelling in tasks where high-frequency control is needed, such as those requiring safety-critical fast reactions.

Cite

Text

Holt et al. "EvoControl: Multi-Frequency Bi-Level Control for High-Frequency Continuous Control." Proceedings of the 42nd International Conference on Machine Learning, 2025.

Markdown

[Holt et al. "EvoControl: Multi-Frequency Bi-Level Control for High-Frequency Continuous Control." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/holt2025icml-evocontrol/)

BibTeX

@inproceedings{holt2025icml-evocontrol,
  title     = {{EvoControl: Multi-Frequency Bi-Level Control for High-Frequency Continuous Control}},
  author    = {Holt, Samuel and Davchev, Todor and Tirumala, Dhruva and Moran, Ben and Iscen, Atil and Laurens, Antoine and Lin, Yixin and Frey, Erik and Wulfmeier, Markus and Romano, Francesco and Heess, Nicolas},
  booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
  year      = {2025},
  pages     = {23451-23512},
  volume    = {267},
  url       = {https://mlanthology.org/icml/2025/holt2025icml-evocontrol/}
}