Deep Learning's Shallow Gains: A Comparative Evaluation of Algorithms for Automatic Music Generation

Abstract

Deep learning methods are recognised as state-of-the-art for many applications of machine learning. Recently, deep learning methods have emerged as a solution to the task of automatic music generation (AMG) using symbolic tokens in a target style, but their superiority over non-deep learning methods has not been demonstrated. Here, we conduct a listening study to comparatively evaluate several music generation systems along six musical dimensions: stylistic success, aesthetic pleasure, repetition or self-reference, melody, harmony, and rhythm. A range of models, both deep learning algorithms and other methods, are used to generate 30-s excerpts in the style of Classical string quartets and classical piano improvisations. Fifty participants with relatively high musical knowledge rate unlabelled samples of computer-generated and human-composed excerpts for the six musical dimensions. We use non-parametric Bayesian hypothesis testing to interpret the results, allowing the possibility of finding meaningful non -differences between systems’ performance. We find that the strongest deep learning method, a reimplemented version of Music Transformer, has equivalent performance to a non-deep learning method, MAIA Markov, demonstrating that to date, deep learning does not outperform other methods for AMG. We also find there still remains a significant gap between any algorithmic method and human-composed excerpts.

Cite

Text

Yin et al. "Deep Learning's Shallow Gains: A Comparative Evaluation of Algorithms for Automatic Music Generation." Machine Learning, 2023. doi:10.1007/S10994-023-06309-W

Markdown

[Yin et al. "Deep Learning's Shallow Gains: A Comparative Evaluation of Algorithms for Automatic Music Generation." Machine Learning, 2023.](https://mlanthology.org/mlj/2023/yin2023mlj-deep/) doi:10.1007/S10994-023-06309-W

BibTeX

@article{yin2023mlj-deep,
  title     = {{Deep Learning's Shallow Gains: A Comparative Evaluation of Algorithms for Automatic Music Generation}},
  author    = {Yin, Zongyu and Reuben, Federico and Stepney, Susan and Collins, Tom},
  journal   = {Machine Learning},
  year      = {2023},
  pages     = {1785-1822},
  doi       = {10.1007/S10994-023-06309-W},
  volume    = {112},
  url       = {https://mlanthology.org/mlj/2023/yin2023mlj-deep/}
}