Visual Learning by Integrating Descriptive and Generative Methods
Abstract
This paper presents a mathematical framework for visual learning that integrates two popular statistical learning paradigms in the literature: (I). Descriptive learning, such as Markov random fields and minimax entropy learning, and (II). Generative learning, such as PCA, ICA, TCA, image coding and HMM. We apply this integrated learning framework to texton modeling, and we assume that an observed texture image is generated by multiple layers of hidden stochastic "texton processes" with each texton being a window function, like a mini-template or a wavelet, under affine transformations. The spatial arrangements of the textons are characterized by minimax entropy models. The texton processes generate images by occlusion or linear addition. Thus given a raw input image, the learning framework achieves four goals: (i). Computing the appearance of the textons. (ii) Inferring the hidden stochastic texton processes. (iii). Learning Gibbs models for each texton process and (iv). Verifying the learnt textons and Gibbs models through random sampling and texture synthesis. The integrated framework subsumes the minimax entropy learning paradigm and creates a richer class of probability models for visual patterns, which are suited for middle level vision representations. Furthermore we show that the integration of description and generative methods yields a natural and general framework of visual learning. We demonstrate the proposed framework and algorithms on many real images.
Cite
Text
Guo et al. "Visual Learning by Integrating Descriptive and Generative Methods." IEEE/CVF International Conference on Computer Vision, 2001. doi:10.1109/ICCV.2001.10107Markdown
[Guo et al. "Visual Learning by Integrating Descriptive and Generative Methods." IEEE/CVF International Conference on Computer Vision, 2001.](https://mlanthology.org/iccv/2001/guo2001iccv-visual/) doi:10.1109/ICCV.2001.10107BibTeX
@inproceedings{guo2001iccv-visual,
title = {{Visual Learning by Integrating Descriptive and Generative Methods}},
author = {Guo, Cheng-en and Zhu, Song Chun and Wu, Ying Nian},
booktitle = {IEEE/CVF International Conference on Computer Vision},
year = {2001},
pages = {370-377},
doi = {10.1109/ICCV.2001.10107},
url = {https://mlanthology.org/iccv/2001/guo2001iccv-visual/}
}