Syntactic Image Parsing Using Ontology and Semantic Descriptions

Abstract

We present an ontology-guided, symbol-based, image parser which involves the use of semantic, spoken language descriptions of entities in images as well as the real-world spatial relationships defined between these entities. Our parsing approach explicitly describes objects and the relationships between them with linguistically meaningful modes of colors, textures and [coarse] expressions of shapes. The image parser is built on a syntactic image grammar-based framework and performs a (near) global optimization using superpixels as an initial set of subpatterns. It hypothesizes the entities in images using their local semantic attributes and verifies them globally using their more global features and their relative spatial locations,. Evaluations of the parser are performed on selected images which we make publicly available along with their manual segmentations and our labeling results.

Cite

Text

Nwogu et al. "Syntactic Image Parsing Using Ontology and Semantic Descriptions." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2010. doi:10.1109/CVPRW.2010.5543723

Markdown

[Nwogu et al. "Syntactic Image Parsing Using Ontology and Semantic Descriptions." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2010.](https://mlanthology.org/cvprw/2010/nwogu2010cvprw-syntactic/) doi:10.1109/CVPRW.2010.5543723

BibTeX

@inproceedings{nwogu2010cvprw-syntactic,
  title     = {{Syntactic Image Parsing Using Ontology and Semantic Descriptions}},
  author    = {Nwogu, Ifeoma and Govindaraju, Venu and Brown, Christopher},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2010},
  pages     = {41-48},
  doi       = {10.1109/CVPRW.2010.5543723},
  url       = {https://mlanthology.org/cvprw/2010/nwogu2010cvprw-syntactic/}
}