MERT: Acoustic music understanding model with large-scale self-supervised training

Y Li, R Yuan, G Zhang, Y Ma, X Chen, H Yin… - arXiv preprint arXiv …, 2023 - arxiv.org
Self-supervised learning (SSL) has recently emerged as a promising paradigm for training
generalisable models on large-scale data in the fields of vision, text, and speech. Although …

Codified audio language modeling learns useful representations for music information retrieval

R Castellon, C Donahue, P Liang - arXiv preprint arXiv:2107.05677, 2021 - arxiv.org
We demonstrate that language models pre-trained on codified (discretely-encoded) music
audio learn representations that are useful for downstream MIR tasks. Specifically, we …

[PDF][PDF] AIST Dance Video Database: Multi-Genre, Multi-Dancer, and Multi-Camera Database for Dance Information Processing.

S Tsuchida, S Fukayama, M Hamasaki, M Goto - ISMIR, 2019 - academia.edu
DB), a shared database containing original street dance videos with copyright-cleared
dance music. Although dancing is highly related to dance music and dance information can …

[图书][B] An introduction to audio content analysis: Music Information Retrieval tasks and applications

A Lerch - 2022 - books.google.com
An Introduction to Audio Content Analysis Enables readers to understand the algorithmic
analysis of musical audio signals with AI-driven approaches An Introduction to Audio …

[PDF][PDF] Deconstruct, Analyse, Reconstruct: How to improve Tempo, Beat, and Downbeat Estimation.

S Böck, MEP Davies - ISMIR, 2020 - program.ismir2020.net
In this paper, we undertake a critical assessment of a stateof-the-art deep neural network
approach for computational rhythm analysis. Our methodology is to deconstruct this …

Supervised and unsupervised learning of audio representations for music understanding

MC McCallum, F Korzeniowski, S Oramas… - arXiv preprint arXiv …, 2022 - arxiv.org
In this work, we provide a broad comparative analysis of strategies for pre-training audio
understanding models for several tasks in the music domain, including labelling of genre …

[PDF][PDF] Multi-Task Learning of Tempo and Beat: Learning One to Improve the Other.

S Böck, MEP Davies, P Knees - ISMIR, 2019 - archives.ismir.net
We propose a multi-task learning approach for simultaneous tempo estimation and beat
tracking of musical audio. The system shows state-of-the-art performance for both tasks on a …

Marble: Music audio representation benchmark for universal evaluation

R Yuan, Y Ma, Y Li, G Zhang, X Chen… - Advances in …, 2023 - proceedings.neurips.cc
In the era of extensive intersection between art and Artificial Intelligence (AI), such as image
generation and fiction co-creation, AI for music remains relatively nascent, particularly in …

[PDF][PDF] A Single-Step Approach to Musical Tempo Estimation Using a Convolutional Neural Network.

H Schreiber, M Müller - Ismir, 2018 - tagtraum.com
We present a single-step musical tempo estimation system based solely on a convolutional
neural network (CNN). Contrary to existing systems, which typically first identify onsets or …

On the effectiveness of speech self-supervised learning for music

Y Ma, R Yuan, Y Li, G Zhang, X Chen, H Yin… - arXiv preprint arXiv …, 2023 - arxiv.org
Self-supervised learning (SSL) has shown promising results in various speech and natural
language processing applications. However, its efficacy in music information retrieval (MIR) …