LipsNet++: Unifying Filter and Controller into a Policy Network
Abstract
Deep reinforcement learning (RL) is effective for decision-making and control tasks like autonomous driving and embodied AI. However, RL policies often suffer from the action fluctuation problem in real-world applications, resulting in severe actuator wear, safety risk, and performance degradation. This paper identifies the two fundamental causes of action fluctuation: observation noise and policy non-smoothness. We propose LipsNet++, a novel policy network with Fourier filter layer and Lipschitz controller layer to separately address both causes. The filter layer incorporates a trainable filter matrix that automatically extracts important frequencies while suppressing noise frequencies in the observations. The controller layer introduces a Jacobian regularization technique to achieve a low Lipschitz constant, ensuring smooth fitting of a policy function. These two layers function analogously to the filter and controller in classical control theory, suggesting that filtering and control capabilities can be seamlessly integrated into a single policy network. Both simulated and real-world experiments demonstrate that LipsNet++ achieves the state-of-the-art noise robustness and action smoothness. The code and videos are publicly available at https://xjsong99.github.io/LipsNet_v2.
Cite
Text
Song et al. "LipsNet++: Unifying Filter and Controller into a Policy Network." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Song et al. "LipsNet++: Unifying Filter and Controller into a Policy Network." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/song2025icml-lipsnet/)BibTeX
@inproceedings{song2025icml-lipsnet,
title = {{LipsNet++: Unifying Filter and Controller into a Policy Network}},
author = {Song, Xujie and Chen, Liangfa and Liu, Tong and Wang, Wenxuan and Wang, Yinuo and Qin, Shentao and Ma, Yinsong and Duan, Jingliang and Li, Shengbo Eben},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {56204-56241},
volume = {267},
url = {https://mlanthology.org/icml/2025/song2025icml-lipsnet/}
}