The Sound of an Album Cover: A Probabilistic Approach to Multimedia

Abstract

We present a novel, flexible, statistical approach to modeling music, images and text jointly. The technique is based on multi-modal mixture models and efficient computation using online EM. The learned models can be used to browse multimedia databases, to query on a multimedia database using any combination of music, images and text (lyrics and other contextual information), to annotate documents with music and images, and to find documents in a database similar to input text, music and/or graphics files.

Cite

Text

Brochu et al. "The Sound of an Album Cover: A Probabilistic Approach to Multimedia." Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics, 2003.

Markdown

[Brochu et al. "The Sound of an Album Cover: A Probabilistic Approach to Multimedia." Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics, 2003.](https://mlanthology.org/aistats/2003/brochu2003aistats-sound/)

BibTeX

@inproceedings{brochu2003aistats-sound,
  title     = {{The Sound of an Album Cover: A Probabilistic Approach to Multimedia}},
  author    = {Brochu, Eric and Freitas, Nando and Bao, Kejie},
  booktitle = {Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics},
  year      = {2003},
  pages     = {49-56},
  volume    = {R4},
  url       = {https://mlanthology.org/aistats/2003/brochu2003aistats-sound/}
}