Trust Region Constrained Measure Transport in Path Space for Stochastic Optimal Control and Inference
Abstract
Solving stochastic optimal control problems with quadratic control costs can be viewed as approximating a target path space measure, e.g. via gradient-based optimization. In practice, however, this optimization is challenging in particular if the target measure differs substantially from the prior. In this work, we therefore approach the problem by iteratively solving constrained problems incorporating trust regions that aim for approaching the target measure gradually in a systematic way. It turns out that this trust region based strategy can be understood as a geometric annealing from the prior to the target measure, where, however, the incorporated trust regions lead to a principled and educated way of choosing the time steps in the annealing path. We demonstrate in multiple optimal control applications that our novel method can improve performance significantly, including tasks in diffusion-based sampling and fine-tuning of diffusion models.
Cite
Text
Blessing et al. "Trust Region Constrained Measure Transport in Path Space for Stochastic Optimal Control and Inference." Advances in Neural Information Processing Systems, 2025.Markdown
[Blessing et al. "Trust Region Constrained Measure Transport in Path Space for Stochastic Optimal Control and Inference." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/blessing2025neurips-trust/)BibTeX
@inproceedings{blessing2025neurips-trust,
title = {{Trust Region Constrained Measure Transport in Path Space for Stochastic Optimal Control and Inference}},
author = {Blessing, Denis and Berner, Julius and Richter, Lorenz and Domingo-Enrich, Carles and Du, Yuanqi and Vahdat, Arash and Neumann, Gerhard},
booktitle = {Advances in Neural Information Processing Systems},
year = {2025},
url = {https://mlanthology.org/neurips/2025/blessing2025neurips-trust/}
}