A Noise-Corrected Langevin Algorithm and Sampling by Half-Denoising

TMLR 2025

/tmlr/2025/hyvarinen2025tmlr-noisecorrected/

Abstract

The Langevin algorithm is a classic method for sampling from a given pdf in a real space. In its basic version, it only requires knowledge of the gradient of the log-density, also called the score function. However, in deep learning, it is often easier to learn the so-called "noisy-data score function", i.e. the gradient of the log-density of noisy data, more precisely when Gaussian noise is added to the data. Such an estimate is biased and complicates the use of the Langevin method. Here, we propose a noise-corrected version of the Langevin algorithm, where the bias due to noisy data is removed, at least regarding first-order terms. Unlike diffusion models, our algorithm only needs to know the noisy-data score function for one single noise level. We further propose a simple special case which has an interesting intuitive interpretation of iteratively adding noise the data and then attempting to remove half of that noise.

PDF TMLR Semantic Scholar

Cite

Text

Hyvarinen. "A Noise-Corrected Langevin Algorithm and Sampling by Half-Denoising." Transactions on Machine Learning Research, 2025.

Markdown

[Hyvarinen. "A Noise-Corrected Langevin Algorithm and Sampling by Half-Denoising." Transactions on Machine Learning Research, 2025.](https://mlanthology.org/tmlr/2025/hyvarinen2025tmlr-noisecorrected/)

BibTeX

@article{hyvarinen2025tmlr-noisecorrected,
  title     = {{A Noise-Corrected Langevin Algorithm and Sampling by Half-Denoising}},
  author    = {Hyvarinen, Aapo},
  journal   = {Transactions on Machine Learning Research},
  year      = {2025},
  url       = {https://mlanthology.org/tmlr/2025/hyvarinen2025tmlr-noisecorrected/}
}