On the Analysis of GAN-Based Image-to-Image Translation with Gaussian Noise Injection

Abstract

Image-to-image (I2I) translation is vital in computer vision tasks like style transfer and domain adaptation. While recent advances in GAN have enabled high-quality sample generation, real-world challenges such as noise and distortion remain significant obstacles. Although Gaussian noise injection during training has been utilized, its theoretical underpinnings have been unclear. This work provides a robust theoretical framework elucidating the role of Gaussian noise injection in I2I translation models. We address critical questions on the influence of noise variance on distribution divergence, resilience to unseen noise types, and optimal noise intensity selection. Our contributions include connecting $f$-divergence and score matching, unveiling insights into the impact of Gaussian noise on aligning probability distributions, and demonstrating generalized robustness implications. We also explore choosing an optimal training noise level for consistent performance in noisy environments. Extensive experiments validate our theoretical findings, showing substantial improvements over various I2I baseline models in noisy settings. Our research rigorously grounds Gaussian noise injection for I2I translation, offering a sophisticated theoretical understanding beyond heuristic applications.

Cite

Text

Shi et al. "On the Analysis of GAN-Based Image-to-Image Translation with Gaussian Noise Injection." International Conference on Learning Representations, 2024.

Markdown

[Shi et al. "On the Analysis of GAN-Based Image-to-Image Translation with Gaussian Noise Injection." International Conference on Learning Representations, 2024.](https://mlanthology.org/iclr/2024/shi2024iclr-analysis/)

BibTeX

@inproceedings{shi2024iclr-analysis,
  title     = {{On the Analysis of GAN-Based Image-to-Image Translation with Gaussian Noise Injection}},
  author    = {Shi, Chaohua and Huang, Kexin and Gan, Lu and Liu, Hongqing and Zhu, Mingrui and Wang, Nannan and Gao, Xinbo},
  booktitle = {International Conference on Learning Representations},
  year      = {2024},
  url       = {https://mlanthology.org/iclr/2024/shi2024iclr-analysis/}
}