End-to-End Learning Framework for Solving Non-Markovian Optimal Control

Abstract

Integer-order calculus fails to capture the long-range dependence (LRD) and memory effects found in many complex systems. Fractional calculus addresses these gaps through fractional-order integrals and derivatives, but fractional-order dynamical systems pose substantial challenges in system identification and optimal control tasks. In this paper, we theoretically derive the optimal control via linear quadratic regulator (LQR) for fractional-order linear time-invariant (FOLTI) systems and develop an end-to-end deep learning framework based on this theoretical foundation. Our approach establishes a rigorous mathematical model, derives analytical solutions, and incorporates deep learning to achieve data-driven optimal control of FOLTI systems. Our key contributions include: (i) proposing a novel method for system identification and optimal control strategy in FOLTI systems, (ii) developing the first end-to-end data-driven learning framework, Fractional-Order Learning for Optimal Control (FOLOC), that learns control policies from observed trajectories, and (iii) deriving theoretical bounds on the sample complexity for learning accurate control policies under fractional-order dynamics. Experimental results indicate that our method accurately approximates fractional-order system behaviors without relying on Gaussian noise assumptions, pointing to promising avenues for advanced optimal control.

Cite

Text

Zhang et al. "End-to-End Learning Framework for Solving Non-Markovian Optimal Control." Proceedings of the 42nd International Conference on Machine Learning, 2025.

Markdown

[Zhang et al. "End-to-End Learning Framework for Solving Non-Markovian Optimal Control." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/zhang2025icml-endtoend/)

BibTeX

@inproceedings{zhang2025icml-endtoend,
  title     = {{End-to-End Learning Framework for Solving Non-Markovian Optimal Control}},
  author    = {Zhang, Xiaole and Zhang, Peiyu and Xiao, Xiongye and Li, Shixuan and Tzoumas, Vasileios and Gupta, Vijay and Bogdan, Paul},
  booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
  year      = {2025},
  pages     = {76966-76997},
  volume    = {267},
  url       = {https://mlanthology.org/icml/2025/zhang2025icml-endtoend/}
}