Latent Diffusion-Enhanced Virtual Try-on via Optimized Pseudo-Label Generation

Abstract

Efficiently applying fully supervised learning to virtual try-on tasks is challenging due to the lack of paired ground truth in available training samples. Recent works have achieved virtual try-ons by employing self-supervised learning-based inpainting paradigms. However, this approach is heavily dependent on the constraints of inpainting masks. An incorrect mask can mislead the generated results, while overly large mask areas can lose essential original information, thereby hindering the synthesis of high-quality results. To address these problems, we propose a latent diffusion model-based virtual try-on network that achieves fully supervised learning using the concept of cycle consistency and knowledge distillation. Specifically, we divide our approach into pretext and downstream tasks. In the pretext task, we generate a pseudo-label (pseudo-person image) to form paired training samples, which enables the downstream task to achieve fully supervised learning. To prevent the unreliable pseudo-person image from introducing irresponsible prior knowledge, we propose a noise-covering strategy, which aims at fully optimizing the pseudo-label to eliminate the impact of the incorrect inpainting mask as much as possible. Additionally, we propose a skin refinement loss to further enhance the generation of details in the skin region. Extended experiments demonstrate that our proposed method is superior to state-of-the-art methods.

Cite

Text

Du et al. "Latent Diffusion-Enhanced Virtual Try-on via Optimized Pseudo-Label Generation." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I3.32283

Markdown

[Du et al. "Latent Diffusion-Enhanced Virtual Try-on via Optimized Pseudo-Label Generation." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/du2025aaai-latent/) doi:10.1609/AAAI.V39I3.32283

BibTeX

@inproceedings{du2025aaai-latent,
  title     = {{Latent Diffusion-Enhanced Virtual Try-on via Optimized Pseudo-Label Generation}},
  author    = {Du, Chenghu and Wang, Junyin and Yu, Feng and Xiong, Shengwu},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {2780-2788},
  doi       = {10.1609/AAAI.V39I3.32283},
  url       = {https://mlanthology.org/aaai/2025/du2025aaai-latent/}
}