A survey on deep learning for symbolic music generation: Representations, algorithms, evaluations, and challenges

S Ji, X Yang, J Luo - ACM Computing Surveys, 2023 - dl.acm.org
Significant progress has been made in symbolic music generation with the help of deep
learning techniques. However, the tasks covered by symbolic music generation have not …

Sparks of large audio models: A survey and outlook

S Latif, M Shoukat, F Shamshad, M Usama… - arXiv preprint arXiv …, 2023 - arxiv.org
This survey paper provides a comprehensive overview of the recent advancements and
challenges in applying large language models to the field of audio signal processing. Audio …

Musecoco: Generating symbolic music from text

P Lu, X Xu, C Kang, B Yu, C Xing, X Tan… - arXiv preprint arXiv …, 2023 - arxiv.org
Generating music from text descriptions is a user-friendly mode since the text is a relatively
easy interface for user engagement. While some approaches utilize texts to control music …

Musicagent: An ai agent for music understanding and generation with large language models

D Yu, K Song, P Lu, T He, X Tan, W Ye, S Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
AI-empowered music processing is a diverse field that encompasses dozens of tasks,
ranging from generation tasks (eg, timbre synthesis) to comprehension tasks (eg, music …

Foundation models for music: A survey

Y Ma, A Øland, A Ragni, BMS Del Sette, C Saitis… - arXiv preprint arXiv …, 2024 - arxiv.org
In recent years, foundation models (FMs) such as large language models (LLMs) and latent
diffusion models (LDMs) have profoundly impacted diverse sectors, including music. This …

Pit: Optimization of dynamic sparse deep learning models via permutation invariant transformation

N Zheng, H Jiang, Q Zhang, Z Han, L Ma… - Proceedings of the 29th …, 2023 - dl.acm.org
Dynamic sparsity, where the sparsity patterns are unknown until runtime, poses a significant
challenge to deep learning. The state-of-the-art sparsity-aware deep learning solutions are …

MelodyGLM: multi-task pre-training for symbolic melody generation

X Wu, Z Huang, K Zhang, J Yu, X Tan, T Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
Pre-trained language models have achieved impressive results in various music
understanding and generation tasks. However, existing pre-training methods for symbolic …

WuYun: exploring hierarchical skeleton-guided melody generation using knowledge-enhanced deep learning

K Zhang, X Wu, T Zhang, Z Huang, X Tan… - arXiv preprint arXiv …, 2023 - arxiv.org
Although deep learning has revolutionized music generation, existing methods for structured
melody generation follow an end-to-end left-to-right note-by-note generative paradigm and …

A survey on artificial intelligence for music generation: Agents, domains and perspectives

C Hernandez-Olivan, J Hernandez-Olivan… - arXiv preprint arXiv …, 2022 - arxiv.org
Music is one of the Gardner's intelligences in his theory of multiple intelligences. How
humans perceive and understand music is still being studied and is crucial to develop …

Diff-BGM: A Diffusion Model for Video Background Music Generation

S Li, Y Qin, M Zheng, X Jin… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
When editing a video a piece of attractive background music is indispensable. However
video background music generation tasks face several challenges for example the lack of …