Summarizing Data Structures with Gaussian Process and Robust Neighborhood Preservation
Abstract
Latent variable models summarize high-dimensional data while preserving its many complex properties. This paper proposes a locality-aware and low-rank approximated Gaussian process latent variable model (LolaGP) that can preserve the global relationship and local geometry in the derivation of the latent variables. We realize the global relationship by imitating the sample similarity non-linearly and the local geometry based on our newly constructed neighborhood graph. Formally, we derive LolaGP from GP-LVM and implement a locality-aware regularization to reflect its adjacency relationship. The neighborhood graph is constructed based on the latent variables, making the local preservation more resistant to noise disruption and the curse of dimensionality than the previous methods that directly construct it from the high-dimensional data. Furthermore, we introduce a new lower bound of a log-posterior distribution based on low-rank matrix approximation, which allows LolaGP to handle larger datasets than the conventional GP-LVM extensions. Our contribution is to preserve both the global and local structures in the derivation of the latent variables using the robust neighborhood graph and introduce the scalable lower bound of the log-posterior distribution. We conducted an experimental analysis using synthetic as well as images with and without highly noise disrupted datasets. From both qualitative and quantitative standpoint, our method produced successful results in all experimental settings.
Cite
Text
Watanabe et al. "Summarizing Data Structures with Gaussian Process and Robust Neighborhood Preservation." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2022. doi:10.1007/978-3-031-26419-1_10Markdown
[Watanabe et al. "Summarizing Data Structures with Gaussian Process and Robust Neighborhood Preservation." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2022.](https://mlanthology.org/ecmlpkdd/2022/watanabe2022ecmlpkdd-summarizing/) doi:10.1007/978-3-031-26419-1_10BibTeX
@inproceedings{watanabe2022ecmlpkdd-summarizing,
title = {{Summarizing Data Structures with Gaussian Process and Robust Neighborhood Preservation}},
author = {Watanabe, Koshi and Maeda, Keisuke and Ogawa, Takahiro and Haseyama, Miki},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2022},
pages = {157-173},
doi = {10.1007/978-3-031-26419-1_10},
url = {https://mlanthology.org/ecmlpkdd/2022/watanabe2022ecmlpkdd-summarizing/}
}