Manifold Denoising as Preprocessing for Finding Natural Representations of Data
Abstract
A natural representation of data are the parameters which generated the data. If the parameter space is continuous we can regard it as a manifold. In practice we usually do not know this manifold but we just\nhave some representation of the data, often in a very high-dimensional feature space. Since the number of internal parameters does not\nchange with the representation, the data will effectively lie on a low-dimensional submanifold in feature space. Due to measurement errors this data is usually corrupted by noise which particularly in high-dimensional feature spaces makes it almost impossible to find the manifold structure.\nThis paper reviews a method called Manifold Denoising which projects\nthe data onto the submanifold using a diffusion process on a graph generated by the data. We will demonstrate\nthat the method is capable of dealing with non-trival high-dimensional noise. Moreover we will show that using\nthe method as a preprocessing step one can significantly improve the results of a semi-supervised learning algorithm.
Cite
Text
Hein and Maier. "Manifold Denoising as Preprocessing for Finding Natural Representations of Data." AAAI Conference on Artificial Intelligence, 2007.Markdown
[Hein and Maier. "Manifold Denoising as Preprocessing for Finding Natural Representations of Data." AAAI Conference on Artificial Intelligence, 2007.](https://mlanthology.org/aaai/2007/hein2007aaai-manifold/)BibTeX
@inproceedings{hein2007aaai-manifold,
title = {{Manifold Denoising as Preprocessing for Finding Natural Representations of Data}},
author = {Hein, Matthias and Maier, Markus},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2007},
pages = {1646-1649},
url = {https://mlanthology.org/aaai/2007/hein2007aaai-manifold/}
}