The Sound of an Album Cover: A Probabilistic Approach to Multimedia
Abstract
We present a novel, flexible, statistical approach to modeling music, images and text jointly. The technique is based on multi-modal mixture models and efficient computation using online EM. The learned models can be used to browse multimedia databases, to query on a multimedia database using any combination of music, images and text (lyrics and other contextual information), to annotate documents with music and images, and to find documents in a database similar to input text, music and/or graphics files.
Cite
Text
Brochu et al. "The Sound of an Album Cover: A Probabilistic Approach to Multimedia." Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics, 2003.Markdown
[Brochu et al. "The Sound of an Album Cover: A Probabilistic Approach to Multimedia." Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics, 2003.](https://mlanthology.org/aistats/2003/brochu2003aistats-sound/)BibTeX
@inproceedings{brochu2003aistats-sound,
title = {{The Sound of an Album Cover: A Probabilistic Approach to Multimedia}},
author = {Brochu, Eric and Freitas, Nando and Bao, Kejie},
booktitle = {Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics},
year = {2003},
pages = {49-56},
volume = {R4},
url = {https://mlanthology.org/aistats/2003/brochu2003aistats-sound/}
}