Deep Appearance Maps

Abstract

We propose a deep representation of appearance, i.e. the relation of color, surface orientation, viewer position, material and illumination. Previous approaches have used deep learning to extract classic appearance representations relating to reflectance model parameters (e.g. Phong) or illumination (e.g. HDR environment maps). We suggest to directly represent appearance itself as a network we call a deep appearance map (DAM). This is a 4D generalization over 2D reflectance maps, which held the view direction fixed. First, we show how a DAM can be learned from images or video frames and later be used to synthesize appearance, given new surface orientations and viewer positions. Second, we demonstrate how another network can be used to map from an image or video frames to a DAM network to reproduce this appearance, without using a lengthy optimization such as stochastic gradient descent (learning-to-learn). Finally, we show the example of an appearance estimation-and-segmentation task, mapping from an image showing multiple materials to multiple deep appearance maps.

Cite

Text

Maximov et al. "Deep Appearance Maps." Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019. doi:10.1109/ICCV.2019.00882

Markdown

[Maximov et al. "Deep Appearance Maps." Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019.](https://mlanthology.org/iccv/2019/maximov2019iccv-deep/) doi:10.1109/ICCV.2019.00882

BibTeX

@inproceedings{maximov2019iccv-deep,
  title     = {{Deep Appearance Maps}},
  author    = {Maximov, Maxim and Leal-Taixe, Laura and Fritz, Mario and Ritschel, Tobias},
  booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision},
  year      = {2019},
  doi       = {10.1109/ICCV.2019.00882},
  url       = {https://mlanthology.org/iccv/2019/maximov2019iccv-deep/}
}