Control Structures for Incorporating Picture-Specific Context in Image Interpretation

Abstract

This paper describes an efficient control mechanism for incorporating picture-specific context in the task of image interpretation. Although other knowledge-based vision systems use general domain context in reducing the computational burden of image interpretation, to our knowledge, this is the first effort in exploring picture-specific collateral information. We assume that constraints on the picture are generated from a natural language understanding module which processes descriptive text accompanying the pictures. We have developed a unified framework for exploiting these constraints both in the object location and identification (labeling) stage. In particular, we describe a technique for incorporating constrained search in context-based vision. Finally, we demonstrate the effectiveness of this approach in PICTION, a system that uses captions to label human faces in newspaper photographs. 1 Introduction To solve the inherently under-constrained task of image ...

Cite

Text

Chopra and Srihari. "Control Structures for Incorporating Picture-Specific Context in Image Interpretation." International Joint Conference on Artificial Intelligence, 1995.

Markdown

[Chopra and Srihari. "Control Structures for Incorporating Picture-Specific Context in Image Interpretation." International Joint Conference on Artificial Intelligence, 1995.](https://mlanthology.org/ijcai/1995/chopra1995ijcai-control/)

BibTeX

@inproceedings{chopra1995ijcai-control,
  title     = {{Control Structures for Incorporating Picture-Specific Context in Image Interpretation}},
  author    = {Chopra, Rajiv and Srihari, Rohini K.},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {1995},
  pages     = {50-55},
  url       = {https://mlanthology.org/ijcai/1995/chopra1995ijcai-control/}
}