A Sketch-Transformer Network for Face Photo-Sketch Synthesis

Abstract

We present a face photo-sketch synthesis model, which converts a face photo into an artistic face sketch or recover a photo-realistic facial image from a sketch portrait. Recent progress has been made by convolutional neural networks (CNNs) and generative adversarial networks (GANs), so that promising results can be obtained through real-time end-to-end architectures. However, convolutional architectures tend to focus on local information and neglect long-range spatial dependency, which limits the ability of existing approaches in keeping global structural information. In this paper, we propose a Sketch-Transformer network for face photo-sketch synthesis, which consists of three closely-related modules, including a multi-scale feature and position encoder for patch-level feature and position embedding, a self-attention module for capturing long-range spatial dependency, and a multi-scale spatially-adaptive de-normalization decoder for image reconstruction. Such a design enables the model to generate reasonable detail texture while maintaining global structural information. Extensive experiments show that the proposed method achieves significant improvements over state-of-the-art approaches on both quantitative and qualitative evaluations.

Cite

Text

Zhu et al. "A Sketch-Transformer Network for Face Photo-Sketch Synthesis." International Joint Conference on Artificial Intelligence, 2021. doi:10.24963/IJCAI.2021/187

Markdown

[Zhu et al. "A Sketch-Transformer Network for Face Photo-Sketch Synthesis." International Joint Conference on Artificial Intelligence, 2021.](https://mlanthology.org/ijcai/2021/zhu2021ijcai-sketch/) doi:10.24963/IJCAI.2021/187

BibTeX

@inproceedings{zhu2021ijcai-sketch,
  title     = {{A Sketch-Transformer Network for Face Photo-Sketch Synthesis}},
  author    = {Zhu, Mingrui and Liang, Changcheng and Wang, Nannan and Wang, Xiaoyu and Li, Zhifeng and Gao, Xinbo},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {1352-1358},
  doi       = {10.24963/IJCAI.2021/187},
  url       = {https://mlanthology.org/ijcai/2021/zhu2021ijcai-sketch/}
}