Paint-It: Text-to-Texture Synthesis via Deep Convolutional Texture mAP Optimization and Physically-Based Rendering

Abstract

We present Paint-it a text-driven high-fidelity texture map synthesis method for 3D meshes via neural re-parameterized texture optimization. Paint-it synthesizes texture maps from a text description by synthesis-through-optimization exploiting the Score-Distillation Sampling (SDS). We observe that directly applying SDS yields undesirable texture quality due to its noisy gradients. We reveal the importance of texture parameterization when using SDS. Specifically we propose Deep Convolutional Physically-Based Rendering (DC-PBR) parameterization which re-parameterizes the physically-based rendering (PBR) texture maps with randomly initialized convolution-based neural kernels instead of a standard pixel-based parameterization. We show that DC-PBR inherently schedules the optimization curriculum according to texture frequency and naturally filters out the noisy signals from SDS. In experiments Paint-it obtains remarkable quality PBR texture maps within 15 min. given only a text description. We demonstrate the generalizability and practicality of Paint-it by synthesizing high-quality texture maps for large-scale mesh datasets and showing test-time applications such as relighting and material control using a popular graphics engine.

Cite

Text

Youwang et al. "Paint-It: Text-to-Texture Synthesis via Deep Convolutional Texture mAP Optimization and Physically-Based Rendering." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.00416

Markdown

[Youwang et al. "Paint-It: Text-to-Texture Synthesis via Deep Convolutional Texture mAP Optimization and Physically-Based Rendering." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/youwang2024cvpr-paintit/) doi:10.1109/CVPR52733.2024.00416

BibTeX

@inproceedings{youwang2024cvpr-paintit,
  title     = {{Paint-It: Text-to-Texture Synthesis via Deep Convolutional Texture mAP Optimization and Physically-Based Rendering}},
  author    = {Youwang, Kim and Oh, Tae-Hyun and Pons-Moll, Gerard},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {4347-4356},
  doi       = {10.1109/CVPR52733.2024.00416},
  url       = {https://mlanthology.org/cvpr/2024/youwang2024cvpr-paintit/}
}