MRBERT: Pre-Training of Melody and Rhythm for Automatic Music Generation

Li, Shuyu; Sung, Yunsick

doi:10.3390/math11040798

Open AccessArticle

MRBERT: Pre-Training of Melody and Rhythm for Automatic Music Generation

by

Shuyu Li

¹ and

Yunsick Sung

^2,*

¹

Department of Multimedia Engineering, Graduate School, Dongguk University–Seoul, Seoul 04620, Republic of Korea

²

Department of Multimedia Engineering, Dongguk University–Seoul, Seoul 04620, Republic of Korea

^*

Author to whom correspondence should be addressed.

Mathematics 2023, 11(4), 798; https://doi.org/10.3390/math11040798

Submission received: 26 December 2022 / Revised: 11 January 2023 / Accepted: 1 February 2023 / Published: 4 February 2023

(This article belongs to the Special Issue Advanced Artificial Intelligence Models and Its Applications)

Download

Browse Figures

Versions Notes

Abstract

Deep learning technology has been extensively studied for its potential in music, notably for creative music generation research. Traditional music generation approaches based on recurrent neural networks cannot provide satisfactory long-distance dependencies. These approaches are typically designed for specific tasks, such as melody and chord generation, and cannot generate diverse music simultaneously. Pre-training is used in natural language processing to accomplish various tasks and overcome the limitation of long-distance dependencies. However, pre-training is not yet widely used in automatic music generation. Because of the differences in the attributes of language and music, traditional pre-trained models utilized in language modeling cannot be directly applied to music fields. This paper proposes a pre-trained model, MRBERT, for multitask-based music generation to learn melody and rhythm representation. The pre-trained model can be applied to music generation applications such as web-based music composers that includes the functions of melody and rhythm generation, modification, completion, and chord matching after being fine-tuned. The results of ablation experiments performed on the proposed model revealed that under the evaluation metrics of HITS@k, the pre-trained MRBERT considerably improved the performance of the generation tasks by 0.09–13.10% and 0.02–7.37%, compared to the usage of RNNs and the original BERT, respectively.

Keywords: automatic music generation; generative pre-training; embedding; representation learning

Share and Cite

MDPI and ACS Style

Li, S.; Sung, Y. MRBERT: Pre-Training of Melody and Rhythm for Automatic Music Generation. Mathematics 2023, 11, 798. https://doi.org/10.3390/math11040798

AMA Style

Li S, Sung Y. MRBERT: Pre-Training of Melody and Rhythm for Automatic Music Generation. Mathematics. 2023; 11(4):798. https://doi.org/10.3390/math11040798

Chicago/Turabian Style

Li, Shuyu, and Yunsick Sung. 2023. "MRBERT: Pre-Training of Melody and Rhythm for Automatic Music Generation" Mathematics 11, no. 4: 798. https://doi.org/10.3390/math11040798

APA Style

Li, S., & Sung, Y. (2023). MRBERT: Pre-Training of Melody and Rhythm for Automatic Music Generation. Mathematics, 11(4), 798. https://doi.org/10.3390/math11040798

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

MRBERT: Pre-Training of Melody and Rhythm for Automatic Music Generation

Abstract

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI