Single Stage Virtual Try-on via Deformable Attention Flows

Abstract

Virtual try-on aims to generate a photo-realistic fitting result given an in-shop garment and a reference person image. Existing methods usually build up multi-stage frameworks to deal with clothes warping and body blending respectively, or rely heavily on intermediate parser-based labels which may be noisy or even inaccurate. To solve the above challenges, we propose a single-stage try-on framework by developing a novel Deformable Attention Flow (DAFlow), which applies the deformable attention scheme to multi-flow estimation. With pose keypoints as the guidance only, the self- and cross-deformable attention flows are estimated for the reference person and the garment images, respectively. By sampling multiple flow fields, the feature-level and pixel-level information from different semantic areas is simultaneously extracted and merged through the attention mechanism. It enables clothes warping and body synthesizing at the same time which leads to photo-realistic results in an end-to-end manner. Extensive experiments on two try-on datasets demonstrate that our proposed method achieves state-of-the-art performance both qualitatively and quantitatively. Furthermore, additional experiments on the other two image editing tasks illustrate the versatility of our method for multi-view synthesis and image animation. Code will be made available at https://github.com/OFA-Sys/DAFlow.

Cite

Text

Bai et al. "Single Stage Virtual Try-on via Deformable Attention Flows." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19784-0_24

Markdown

[Bai et al. "Single Stage Virtual Try-on via Deformable Attention Flows." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/bai2022eccv-single/) doi:10.1007/978-3-031-19784-0_24

BibTeX

@inproceedings{bai2022eccv-single,
  title     = {{Single Stage Virtual Try-on via Deformable Attention Flows}},
  author    = {Bai, Shuai and Zhou, Huiling and Li, Zhikang and Zhou, Chang and Yang, Hongxia},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-19784-0_24},
  url       = {https://mlanthology.org/eccv/2022/bai2022eccv-single/}
}