Deep Learning's Shallow Gains: A Comparative Evaluation of Algorithms for Automatic Music Generation
Abstract
Deep learning methods are recognised as state-of-the-art for many applications of machine learning. Recently, deep learning methods have emerged as a solution to the task of automatic music generation (AMG) using symbolic tokens in a target style, but their superiority over non-deep learning methods has not been demonstrated. Here, we conduct a listening study to comparatively evaluate several music generation systems along six musical dimensions: stylistic success, aesthetic pleasure, repetition or self-reference, melody, harmony, and rhythm. A range of models, both deep learning algorithms and other methods, are used to generate 30-s excerpts in the style of Classical string quartets and classical piano improvisations. Fifty participants with relatively high musical knowledge rate unlabelled samples of computer-generated and human-composed excerpts for the six musical dimensions. We use non-parametric Bayesian hypothesis testing to interpret the results, allowing the possibility of finding meaningful non -differences between systems’ performance. We find that the strongest deep learning method, a reimplemented version of Music Transformer, has equivalent performance to a non-deep learning method, MAIA Markov, demonstrating that to date, deep learning does not outperform other methods for AMG. We also find there still remains a significant gap between any algorithmic method and human-composed excerpts.
Cite
Text
Yin et al. "Deep Learning's Shallow Gains: A Comparative Evaluation of Algorithms for Automatic Music Generation." Machine Learning, 2023. doi:10.1007/S10994-023-06309-WMarkdown
[Yin et al. "Deep Learning's Shallow Gains: A Comparative Evaluation of Algorithms for Automatic Music Generation." Machine Learning, 2023.](https://mlanthology.org/mlj/2023/yin2023mlj-deep/) doi:10.1007/S10994-023-06309-WBibTeX
@article{yin2023mlj-deep,
title = {{Deep Learning's Shallow Gains: A Comparative Evaluation of Algorithms for Automatic Music Generation}},
author = {Yin, Zongyu and Reuben, Federico and Stepney, Susan and Collins, Tom},
journal = {Machine Learning},
year = {2023},
pages = {1785-1822},
doi = {10.1007/S10994-023-06309-W},
volume = {112},
url = {https://mlanthology.org/mlj/2023/yin2023mlj-deep/}
}