Foundation models for music: A survey
In recent years, foundation models (FMs) such as large language models (LLMs) and latent
diffusion models (LDMs) have profoundly impacted diverse sectors, including music. This …
diffusion models (LDMs) have profoundly impacted diverse sectors, including music. This …
Retrieval guided music captioning via multimodal prefixes
In this paper we put forward a new approach to music captioning, the task of automatically
generating natural language descriptions for songs. These descriptions are useful both for …
generating natural language descriptions for songs. These descriptions are useful both for …
[PDF][PDF] Leveraging Structure and Context for Language-Adjacent Representation Learning
N Srivatsan - 2024 - kilthub.cmu.edu
When learning representations from large corpora of language data, the overwhelming
strategy is to interpret that data as a collection of IID samples to be modeled in isolation from …
strategy is to interpret that data as a collection of IID samples to be modeled in isolation from …