Learning Compositional Sparse Models of Bimodal Percepts

Suren Kumar, Vikas Dhiman, Jason J. Corso

AAAI 2014 pp. 366-372

doi:10.1609/AAAI.V28I1.8753 /aaai/2014/kumar2014aaai-learning/

Abstract

Various perceptual domains have underlying compositional semantics that are rarely captured in current models. We suspect this is because directly learning the compositional structure has evaded these models. Yet, the compositional structure of a given domain can be grounded in a separate domain thereby simplifying its learning. To that end, we propose a new approach to modeling bimodal percepts that explicitly relates distinct projections across each modality and then jointly learns a bimodal sparse representation. The resulting model enables compositionality across these distinct projections and hence can generalize to unobserved percepts spanned by this compositional basis. For example, our model can be trained on 'red triangles' and 'blue squares'; yet, implicitly will also have learned 'red squares' and 'blue triangles'. The structure of the projections and hence the compositional basis is learned automatically for a given language model. To test our model, we have acquired a new bimodal dataset comprising images and spoken utterances of colored shapes in a tabletop setup. Our experiments demonstrate the benefits of explicitly leveraging compositionality in both quantitative and human evaluation studies.

PDF AAAI Semantic Scholar

Cite

Text

Kumar et al. "Learning Compositional Sparse Models of Bimodal Percepts." AAAI Conference on Artificial Intelligence, 2014. doi:10.1609/AAAI.V28I1.8753

Markdown

[Kumar et al. "Learning Compositional Sparse Models of Bimodal Percepts." AAAI Conference on Artificial Intelligence, 2014.](https://mlanthology.org/aaai/2014/kumar2014aaai-learning/) doi:10.1609/AAAI.V28I1.8753

BibTeX

@inproceedings{kumar2014aaai-learning,
  title     = {{Learning Compositional Sparse Models of Bimodal Percepts}},
  author    = {Kumar, Suren and Dhiman, Vikas and Corso, Jason J.},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2014},
  pages     = {366-372},
  doi       = {10.1609/AAAI.V28I1.8753},
  url       = {https://mlanthology.org/aaai/2014/kumar2014aaai-learning/}
}