Conditional Hand Image Generation Using Latent Space Supervision in Random Variable Variational Autoencoders

Abstract

We introduce a novel framework for generating photorealistic synthetic images of human hands conditioned to a precise pose annotation. We propose a supervised Random Variable Variational Autoencoder (SRV-VAE), a model that disentangles and encodes the appearance and pose of the hand into separate components of the latent space. Appearance, representing individual subject traits, is unsupervised. Hand pose is strictly supervised and yields control over the synthesis process. Leveraging the robust RV VAE variant, our architecture ensures stable training and accurate encoding of complex hand dynamics. Our model is capable of generating hand images of previously unseen hand poses for specific subjects. Experimental results indicate the model’s efficacy in synthesizing realistic and varied hand images, holding significant promise for advancements in both academic research and practical applications such as data upsampling, where accurate hand pose and texture data is critical.

Cite

Text

Nicodemou et al. "Conditional Hand Image Generation Using Latent Space Supervision in Random Variable Variational Autoencoders." European Conference on Computer Vision Workshops, 2024. doi:10.1007/978-3-031-91578-9_5

Markdown

[Nicodemou et al. "Conditional Hand Image Generation Using Latent Space Supervision in Random Variable Variational Autoencoders." European Conference on Computer Vision Workshops, 2024.](https://mlanthology.org/eccvw/2024/nicodemou2024eccvw-conditional/) doi:10.1007/978-3-031-91578-9_5

BibTeX

@inproceedings{nicodemou2024eccvw-conditional,
  title     = {{Conditional Hand Image Generation Using Latent Space Supervision in Random Variable Variational Autoencoders}},
  author    = {Nicodemou, Vassilis C. and Oikonomidis, Iason and Karvounas, Giorgos and Argyros, Antonis A.},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2024},
  pages     = {85-100},
  doi       = {10.1007/978-3-031-91578-9_5},
  url       = {https://mlanthology.org/eccvw/2024/nicodemou2024eccvw-conditional/}
}