Local Topological Data Analysis to Uncover the Global Structure of Data Approaching Graph-Structured Topologies
Abstract
Gene expression data of differentiating cells, galaxies distributed in space, and earthquake locations, all share a common property: they lie close to a graph-structured topology in their respective spaces [ 1 , 4 , 9 , 10 , 20 ], referred to as one-dimensional stratified spaces in mathematics. Often, the uncovering of such topologies offers great insight into these data sets. However, methods for dimensionality reduction are clearly inappropriate for this purpose, and also methods from the relatively new field of Topological Data Analysis (TDA) are inappropriate, due to noise sensitivity, computational complexity, or other limitations. In this paper we introduce a new method, termed Local TDA (LTDA ), which resolves the issues of pre-existing methods by unveiling ( global ) graph-structured topologies in data by means of robust and computationally cheap local analyses. Our method rests on a simple graph-theoretic result that enables one to identify isolated, end-, edge- and multifurcation points in the topology underlying the data. It then uses this information to piece together a graph that is homeomorphic to the unknown one-dimensional stratified space underlying the point cloud data. We evaluate our method on a number of artificial and real-life data sets, demonstrating its superior effectiveness, robustness against noise, and scalability. Code related to this paper is available at: https://bitbucket.org/ghentdatascience/gltda-public .
Cite
Text
Vandaele et al. "Local Topological Data Analysis to Uncover the Global Structure of Data Approaching Graph-Structured Topologies." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2018. doi:10.1007/978-3-030-10928-8_2Markdown
[Vandaele et al. "Local Topological Data Analysis to Uncover the Global Structure of Data Approaching Graph-Structured Topologies." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2018.](https://mlanthology.org/ecmlpkdd/2018/vandaele2018ecmlpkdd-local/) doi:10.1007/978-3-030-10928-8_2BibTeX
@inproceedings{vandaele2018ecmlpkdd-local,
title = {{Local Topological Data Analysis to Uncover the Global Structure of Data Approaching Graph-Structured Topologies}},
author = {Vandaele, Robin and De Bie, Tijl and Saeys, Yvan},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2018},
pages = {19-36},
doi = {10.1007/978-3-030-10928-8_2},
url = {https://mlanthology.org/ecmlpkdd/2018/vandaele2018ecmlpkdd-local/}
}