Museformer: Transformer with fine-and coarse-grained attention for music generation

A survey on deep learning for symbolic music generation: Representations, algorithms, evaluations, and challenges

S Ji, X Yang, J Luo - ACM Computing Surveys, 2023 - dl.acm.org

Significant progress has been made in symbolic music generation with the help of deep
learning techniques. However, the tasks covered by symbolic music generation have not …

被引用次数：50 相关文章

[PDF] arxiv.org

Sparks of large audio models: A survey and outlook

S Latif, M Shoukat, F Shamshad, M Usama… - arXiv preprint arXiv …, 2023 - arxiv.org

This survey paper provides a comprehensive overview of the recent advancements and
challenges in applying large language models to the field of audio signal processing. Audio …

被引用次数：19 相关文章所有 4 个版本

[PDF] arxiv.org

Musecoco: Generating symbolic music from text

P Lu, X Xu, C Kang, B Yu, C Xing, X Tan… - arXiv preprint arXiv …, 2023 - arxiv.org

Generating music from text descriptions is a user-friendly mode since the text is a relatively
easy interface for user engagement. While some approaches utilize texts to control music …

被引用次数：32 相关文章所有 4 个版本

[PDF] arxiv.org

Musicagent: An ai agent for music understanding and generation with large language models

D Yu, K Song, P Lu, T He, X Tan, W Ye, S Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org

AI-empowered music processing is a diverse field that encompasses dozens of tasks,
ranging from generation tasks (eg, timbre synthesis) to comprehension tasks (eg, music …

被引用次数：10 相关文章所有 3 个版本

[PDF] arxiv.org

Foundation models for music: A survey

Y Ma, A Øland, A Ragni, BMS Del Sette, C Saitis… - arXiv preprint arXiv …, 2024 - arxiv.org

In recent years, foundation models (FMs) such as large language models (LLMs) and latent
diffusion models (LDMs) have profoundly impacted diverse sectors, including music. This …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

Pit: Optimization of dynamic sparse deep learning models via permutation invariant transformation

N Zheng, H Jiang, Q Zhang, Z Han, L Ma… - Proceedings of the 29th …, 2023 - dl.acm.org

Dynamic sparsity, where the sparsity patterns are unknown until runtime, poses a significant
challenge to deep learning. The state-of-the-art sparsity-aware deep learning solutions are …

被引用次数：13 相关文章所有 3 个版本

[PDF] arxiv.org

MelodyGLM: multi-task pre-training for symbolic melody generation

X Wu, Z Huang, K Zhang, J Yu, X Tan, T Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org

Pre-trained language models have achieved impressive results in various music
understanding and generation tasks. However, existing pre-training methods for symbolic …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

WuYun: exploring hierarchical skeleton-guided melody generation using knowledge-enhanced deep learning

K Zhang, X Wu, T Zhang, Z Huang, X Tan… - arXiv preprint arXiv …, 2023 - arxiv.org

Although deep learning has revolutionized music generation, existing methods for structured
melody generation follow an end-to-end left-to-right note-by-note generative paradigm and …

被引用次数：9 相关文章所有 2 个版本

[PDF] arxiv.org

A survey on artificial intelligence for music generation: Agents, domains and perspectives

C Hernandez-Olivan, J Hernandez-Olivan… - arXiv preprint arXiv …, 2022 - arxiv.org

Music is one of the Gardner's intelligences in his theory of multiple intelligences. How
humans perceive and understand music is still being studied and is crucial to develop …

被引用次数：11 相关文章所有 2 个版本

[PDF] thecvf.com

Diff-BGM: A Diffusion Model for Video Background Music Generation

S Li, Y Qin, M Zheng, X Jin… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

When editing a video a piece of attractive background music is indispensable. However
video background music generation tasks face several challenges for example the lack of …

被引用次数：1 相关文章所有 4 个版本