TraVeLGAN: Image-to-Image Translation by Transformation Vector Learning
Abstract
Interest in image-to-image translation has grown substantially in recent years with the success of unsupervised models based on the cycle-consistency assumption. The achievements of these models have been limited to a particular subset of domains where this assumption yields good results, namely homogeneous domains that are characterized by style or texture differences. We tackle the challenging problem of image-to-image translation where the domains are defined by high-level shapes and contexts, as well as including significant clutter and heterogeneity. For this purpose, we introduce a novel GAN based on preserving intra-domain vector transformations in a latent space learned by a siamese network. The traditional GAN system introduced a discriminator network to guide the generator into generating images in the target domain. To this two-network system we add a third: a siamese network that guides the generator so that each original image shares semantics with its generated version. With this new three-network system, we no longer need to constrain the generators with the ubiquitous cycle-consistency restraint or any other autoencoding regularization. As a result, the generators can learn mappings between more complex domains that differ from each other by more than just style or texture. We demonstrate our model by mapping between high-resolution, arbitrarily chosen classes from the Imagenet dataset completely without pre-processing such as cropping, centering, or filtering unrepresentative images.
Cite
Text
Amodio and Krishnaswamy. "TraVeLGAN: Image-to-Image Translation by Transformation Vector Learning." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. doi:10.1109/CVPR.2019.00919Markdown
[Amodio and Krishnaswamy. "TraVeLGAN: Image-to-Image Translation by Transformation Vector Learning." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.](https://mlanthology.org/cvpr/2019/amodio2019cvpr-travelgan/) doi:10.1109/CVPR.2019.00919BibTeX
@inproceedings{amodio2019cvpr-travelgan,
title = {{TraVeLGAN: Image-to-Image Translation by Transformation Vector Learning}},
author = {Amodio, Matthew and Krishnaswamy, Smita},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2019},
doi = {10.1109/CVPR.2019.00919},
url = {https://mlanthology.org/cvpr/2019/amodio2019cvpr-travelgan/}
}