MERT: Acoustic music understanding model with large-scale self-supervised training
Self-supervised learning (SSL) has recently emerged as a promising paradigm for training
generalisable models on large-scale data in the fields of vision, text, and speech. Although …
generalisable models on large-scale data in the fields of vision, text, and speech. Although …
Codified audio language modeling learns useful representations for music information retrieval
We demonstrate that language models pre-trained on codified (discretely-encoded) music
audio learn representations that are useful for downstream MIR tasks. Specifically, we …
audio learn representations that are useful for downstream MIR tasks. Specifically, we …
[PDF][PDF] AIST Dance Video Database: Multi-Genre, Multi-Dancer, and Multi-Camera Database for Dance Information Processing.
DB), a shared database containing original street dance videos with copyright-cleared
dance music. Although dancing is highly related to dance music and dance information can …
dance music. Although dancing is highly related to dance music and dance information can …
[图书][B] An introduction to audio content analysis: Music Information Retrieval tasks and applications
A Lerch - 2022 - books.google.com
An Introduction to Audio Content Analysis Enables readers to understand the algorithmic
analysis of musical audio signals with AI-driven approaches An Introduction to Audio …
analysis of musical audio signals with AI-driven approaches An Introduction to Audio …
[PDF][PDF] Deconstruct, Analyse, Reconstruct: How to improve Tempo, Beat, and Downbeat Estimation.
S Böck, MEP Davies - ISMIR, 2020 - program.ismir2020.net
In this paper, we undertake a critical assessment of a stateof-the-art deep neural network
approach for computational rhythm analysis. Our methodology is to deconstruct this …
approach for computational rhythm analysis. Our methodology is to deconstruct this …
Supervised and unsupervised learning of audio representations for music understanding
In this work, we provide a broad comparative analysis of strategies for pre-training audio
understanding models for several tasks in the music domain, including labelling of genre …
understanding models for several tasks in the music domain, including labelling of genre …
[PDF][PDF] Multi-Task Learning of Tempo and Beat: Learning One to Improve the Other.
We propose a multi-task learning approach for simultaneous tempo estimation and beat
tracking of musical audio. The system shows state-of-the-art performance for both tasks on a …
tracking of musical audio. The system shows state-of-the-art performance for both tasks on a …
Marble: Music audio representation benchmark for universal evaluation
In the era of extensive intersection between art and Artificial Intelligence (AI), such as image
generation and fiction co-creation, AI for music remains relatively nascent, particularly in …
generation and fiction co-creation, AI for music remains relatively nascent, particularly in …
[PDF][PDF] A Single-Step Approach to Musical Tempo Estimation Using a Convolutional Neural Network.
H Schreiber, M Müller - Ismir, 2018 - tagtraum.com
We present a single-step musical tempo estimation system based solely on a convolutional
neural network (CNN). Contrary to existing systems, which typically first identify onsets or …
neural network (CNN). Contrary to existing systems, which typically first identify onsets or …
On the effectiveness of speech self-supervised learning for music
Self-supervised learning (SSL) has shown promising results in various speech and natural
language processing applications. However, its efficacy in music information retrieval (MIR) …
language processing applications. However, its efficacy in music information retrieval (MIR) …