Learning Implicit Features with Flow-Infused Transformations for Realistic Virtual Try-on

Abstract

Diffusion-based virtual try-on aims to synthesize a realistic image that seamlessly integrating the specific garment into a target model. The primary challenge lies in effectively guiding the warping process of the latent diffusion model. However, previous methods either lack direct guidance or explicitly warp the garment image, which highly depends on the performance of the warping module. In this paper, we propose FIA-VTON, which leverages the implicit flow feature as guidance by adopting a Flow Infused Attention module on virtual try-on. The dense warp flow map is projected as indirect guidance to enhance the feature map warping in the generation process implicitly, which is less sensitive to the warping estimation accuracy than an explicit warp of the garment image. To further enhance implicit warp guidance, we incorporate high-level spatial attention to complement the dense warp. Experimental results on the VTON-HD and DressCode dataset significantly outperform state-of-the-art methods, demonstrating that FIA-VTON is effective and robust for virtual try-on.

Cite

Text

Zhang et al. "Learning Implicit Features with Flow-Infused Transformations for Realistic Virtual Try-on." International Conference on Computer Vision, 2025.

Markdown

[Zhang et al. "Learning Implicit Features with Flow-Infused Transformations for Realistic Virtual Try-on." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/zhang2025iccv-learning-a/)

BibTeX

@inproceedings{zhang2025iccv-learning-a,
  title     = {{Learning Implicit Features with Flow-Infused Transformations for Realistic Virtual Try-on}},
  author    = {Zhang, Delong and Huang, Qiwei and Sun, Yang and Liu, Yuanliu and Zheng, Wei-Shi and Xiong, Pengfei and Zhang, Wei},
  booktitle = {International Conference on Computer Vision},
  year      = {2025},
  pages     = {18736-18745},
  url       = {https://mlanthology.org/iccv/2025/zhang2025iccv-learning-a/}
}