Bayesian Nonparametric Matrix Factorization for Recorded Music

Abstract

Recent research in machine learning has focused on breaking audio spectrograms into separate sources of sound using latent variable decompositions. These methods require that the number of sources be specified in advance, which is not always possible. To address this problem, we develop Gamma Process Nonnegative Matrix Factorization (GaP-NMF), a Bayesian nonparametric approach to decomposing spectrograms. The assumptions behind GaP-NMF are based on research in signal processing regarding the expected distributions of spectrogram data, and GaP-NMF automatically discovers the number of latent sources. We derive a mean-field variational inference algorithm and evaluate GaP-NMF on both synthetic data and recorded music.

Cite

Text

Hoffman et al. "Bayesian Nonparametric Matrix Factorization for Recorded Music." International Conference on Machine Learning, 2010.

Markdown

[Hoffman et al. "Bayesian Nonparametric Matrix Factorization for Recorded Music." International Conference on Machine Learning, 2010.](https://mlanthology.org/icml/2010/hoffman2010icml-bayesian/)

BibTeX

@inproceedings{hoffman2010icml-bayesian,
  title     = {{Bayesian Nonparametric Matrix Factorization for Recorded Music}},
  author    = {Hoffman, Matthew D. and Blei, David M. and Cook, Perry R.},
  booktitle = {International Conference on Machine Learning},
  year      = {2010},
  pages     = {439-446},
  url       = {https://mlanthology.org/icml/2010/hoffman2010icml-bayesian/}
}