A Noise-Corrected Langevin Algorithm and Sampling by Half-Denoising
Abstract
The Langevin algorithm is a classic method for sampling from a given pdf in a real space. In its basic version, it only requires knowledge of the gradient of the log-density, also called the score function. However, in deep learning, it is often easier to learn the so-called "noisy-data score function", i.e. the gradient of the log-density of noisy data, more precisely when Gaussian noise is added to the data. Such an estimate is biased and complicates the use of the Langevin method. Here, we propose a noise-corrected version of the Langevin algorithm, where the bias due to noisy data is removed, at least regarding first-order terms. Unlike diffusion models, our algorithm only needs to know the noisy-data score function for one single noise level. We further propose a simple special case which has an interesting intuitive interpretation of iteratively adding noise the data and then attempting to remove half of that noise.
Cite
Text
Hyvarinen. "A Noise-Corrected Langevin Algorithm and Sampling by Half-Denoising." Transactions on Machine Learning Research, 2025.Markdown
[Hyvarinen. "A Noise-Corrected Langevin Algorithm and Sampling by Half-Denoising." Transactions on Machine Learning Research, 2025.](https://mlanthology.org/tmlr/2025/hyvarinen2025tmlr-noisecorrected/)BibTeX
@article{hyvarinen2025tmlr-noisecorrected,
title = {{A Noise-Corrected Langevin Algorithm and Sampling by Half-Denoising}},
author = {Hyvarinen, Aapo},
journal = {Transactions on Machine Learning Research},
year = {2025},
url = {https://mlanthology.org/tmlr/2025/hyvarinen2025tmlr-noisecorrected/}
}