Actor-Critic Methods Using Physics-Informed Neural Networks: Control of a 1d PDE Model for Fluid-Cooled Battery Packs

Abstract

This paper proposes an actor-critic algorithm for controlling the temperature of a battery pack using a cooling fluid. This is modeled by a coupled 1D partial differential equation (PDE) with a controlled advection term that determines the speed of the cooling fluid. The Hamilton-Jacobi-Bellman (HJB) equation is a PDE that evaluates the optimality of the value function and determines an optimal controller. We propose an algorithm that treats the value network as a Physics-Informed Neural Network (PINN) to solve the continuous-time HJB equation rather than a discrete-time Bellman optimality equation, and we derive a control function from the HJB equation. Our experiments show that a hybrid-policy method that updates the value network using the HJB equation and updates the policy network identically to PPO achieves the best results in the control of this PDE system.

Cite

Text

Mukherjee and Liu. "Actor-Critic Methods Using Physics-Informed Neural Networks: Control of a 1d PDE Model for Fluid-Cooled Battery Packs." ICML 2023 Workshops: Frontiers4LCD, 2023.

Markdown

[Mukherjee and Liu. "Actor-Critic Methods Using Physics-Informed Neural Networks: Control of a 1d PDE Model for Fluid-Cooled Battery Packs." ICML 2023 Workshops: Frontiers4LCD, 2023.](https://mlanthology.org/icmlw/2023/mukherjee2023icmlw-actorcritic/)

BibTeX

@inproceedings{mukherjee2023icmlw-actorcritic,
  title     = {{Actor-Critic Methods Using Physics-Informed Neural Networks: Control of a 1d PDE Model for Fluid-Cooled Battery Packs}},
  author    = {Mukherjee, Amartya and Liu, Jun},
  booktitle = {ICML 2023 Workshops: Frontiers4LCD},
  year      = {2023},
  url       = {https://mlanthology.org/icmlw/2023/mukherjee2023icmlw-actorcritic/}
}