A Truncated Newton Method for Optimal Transport
Abstract
Developing a contemporary optimal transport (OT) solver requires navigating trade-offs among several critical requirements: GPU parallelization, scalability to high-dimensional problems, theoretical convergence guarantees, empirical performance in terms of precision versus runtime, and numerical stability in practice. With these challenges in mind, we introduce a specialized truncated Newton algorithm for entropic-regularized OT. In addition to proving that locally quadratic convergence is possible without assuming a Lipschitz Hessian, we provide strategies to maximally exploit the high rate of local convergence in practice. Our GPU-parallel algorithm exhibits exceptionally favorable runtime performance, achieving high precision orders of magnitude faster than many existing alternatives. This is evidenced by wall-clock time experiments on 24 problem sets (12 datasets $\times$ 2 cost functions). The scalability of the algorithm is showcased on an extremely large OT problem with $n \approx 10^6$, solved approximately under weak entropic regularization.
Cite
Text
Kemertas et al. "A Truncated Newton Method for Optimal Transport." International Conference on Learning Representations, 2025.Markdown
[Kemertas et al. "A Truncated Newton Method for Optimal Transport." International Conference on Learning Representations, 2025.](https://mlanthology.org/iclr/2025/kemertas2025iclr-truncated/)BibTeX
@inproceedings{kemertas2025iclr-truncated,
title = {{A Truncated Newton Method for Optimal Transport}},
author = {Kemertas, Mete and Farahmand, Amir-massoud and Jepson, Allan Douglas},
booktitle = {International Conference on Learning Representations},
year = {2025},
url = {https://mlanthology.org/iclr/2025/kemertas2025iclr-truncated/}
}