Controllable Visual-Tactile Synthesis

Abstract

Deep generative models have various content creation applications such as graphic design, e-commerce, and virtual try-on. However, current works mainly focus on synthesizing realistic visual outputs, often ignoring other sensory modalities, such as touch, which limits physical interaction with users. In this work, we leverage deep generative models to create a multi-sensory experience where users can touch and see the synthesized object when sliding their fingers on a haptic surface. The main challenges lie in the significant scale discrepancy between vision and touch sensing and the lack of explicit mapping from touch sensing data to a haptic rendering device. To bridge this gap, we collect high-resolution tactile data with a GelSight sensor and create a new visuotactile clothing dataset. We then develop a conditional generative model that synthesizes both visual and tactile outputs from a single sketch. We evaluate our method regarding image quality and tactile rendering accuracy. Finally, we introduce a pipeline to render high-quality visual and tactile outputs on an electroadhesion-based haptic device for an immersive experience, allowing for challenging materials and editable sketch inputs.

Cite

Text

Gao et al. "Controllable Visual-Tactile Synthesis." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.00648

Markdown

[Gao et al. "Controllable Visual-Tactile Synthesis." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/gao2023iccv-controllable/) doi:10.1109/ICCV51070.2023.00648

BibTeX

@inproceedings{gao2023iccv-controllable,
  title     = {{Controllable Visual-Tactile Synthesis}},
  author    = {Gao, Ruihan and Yuan, Wenzhen and Zhu, Jun-Yan},
  booktitle = {International Conference on Computer Vision},
  year      = {2023},
  pages     = {7040-7052},
  doi       = {10.1109/ICCV51070.2023.00648},
  url       = {https://mlanthology.org/iccv/2023/gao2023iccv-controllable/}
}