Pull-Back Geometry of Persistent Homology Encodings

Abstract

Persistent homology (PH) is a method for generating topology-inspired representations of data. Empirical studies that investigate the properties of PH, such as its sensitivity to perturbations or ability to detect a feature of interest, commonly rely on training and testing an additional model on the basis of the PH representation. To gain more intrinsic insights about PH, independently of the choice of such a model, we propose a novel methodology based on the pull-back geometry that a PH encoding induces on the data manifold. The spectrum and eigenvectors of the induced metric help to identify the most and least significant information captured by PH. Furthermore, the pull-back norm of tangent vectors provides insights about the sensitivity of PH to a given perturbation, or its potential to detect a given feature of interest, and in turn its ability to solve a given classification or regression problem. Experimentally, the insights gained through our methodology align well with the existing knowledge about PH. Moreover, we show that the pull-back norm correlates with the performance on downstream tasks, and can therefore guide the choice of a suitable PH encoding.

Cite

Text

Liang et al. "Pull-Back Geometry of Persistent Homology Encodings." Transactions on Machine Learning Research, 2024.

Markdown

[Liang et al. "Pull-Back Geometry of Persistent Homology Encodings." Transactions on Machine Learning Research, 2024.](https://mlanthology.org/tmlr/2024/liang2024tmlr-pullback/)

BibTeX

@article{liang2024tmlr-pullback,
  title     = {{Pull-Back Geometry of Persistent Homology Encodings}},
  author    = {Liang, Shuang and Turkes, Renata and Li, Jiayi and Otter, Nina and Montufar, Guido},
  journal   = {Transactions on Machine Learning Research},
  year      = {2024},
  url       = {https://mlanthology.org/tmlr/2024/liang2024tmlr-pullback/}
}