Two data sets for tempo estimation and key detection in electronic dance music annotated...

Y Li, R Yuan, G Zhang, Y Ma, X Chen, H Yin… - arXiv preprint arXiv …, 2023 - arxiv.org

Self-supervised learning (SSL) has recently emerged as a promising paradigm for training
generalisable models on large-scale data in the fields of vision, text, and speech. Although …

被引用次数：47 相关文章所有 6 个版本

[PDF] arxiv.org

Codified audio language modeling learns useful representations for music information retrieval

R Castellon, C Donahue, P Liang - arXiv preprint arXiv:2107.05677, 2021 - arxiv.org

We demonstrate that language models pre-trained on codified (discretely-encoded) music
audio learn representations that are useful for downstream MIR tasks. Specifically, we …

被引用次数：90 相关文章所有 3 个版本

[PDF] academia.edu

[PDF][PDF] AIST Dance Video Database: Multi-Genre, Multi-Dancer, and Multi-Camera Database for Dance Information Processing.

S Tsuchida, S Fukayama, M Hamasaki, M Goto - ISMIR, 2019 - academia.edu

DB), a shared database containing original street dance videos with copyright-cleared
dance music. Although dancing is highly related to dance music and dance information can …

被引用次数：159 相关文章所有 3 个版本

[图书][B] An introduction to audio content analysis: Music Information Retrieval tasks and applications

A Lerch - 2022 - books.google.com

An Introduction to Audio Content Analysis Enables readers to understand the algorithmic
analysis of musical audio signals with AI-driven approaches An Introduction to Audio …

被引用次数：448 相关文章所有 7 个版本

[PDF] ismir2020.net

[PDF][PDF] Deconstruct, Analyse, Reconstruct: How to improve Tempo, Beat, and Downbeat Estimation.

S Böck, MEP Davies - ISMIR, 2020 - program.ismir2020.net

In this paper, we undertake a critical assessment of a stateof-the-art deep neural network
approach for computational rhythm analysis. Our methodology is to deconstruct this …

被引用次数：71 相关文章所有 2 个版本

[PDF] arxiv.org

Supervised and unsupervised learning of audio representations for music understanding

MC McCallum, F Korzeniowski, S Oramas… - arXiv preprint arXiv …, 2022 - arxiv.org

In this work, we provide a broad comparative analysis of strategies for pre-training audio
understanding models for several tasks in the music domain, including labelling of genre …

被引用次数：32 相关文章所有 4 个版本

[PDF] ismir.net

[PDF][PDF] Multi-Task Learning of Tempo and Beat: Learning One to Improve the Other.

S Böck, MEP Davies, P Knees - ISMIR, 2019 - archives.ismir.net

We propose a multi-task learning approach for simultaneous tempo estimation and beat
tracking of musical audio. The system shows state-of-the-art performance for both tasks on a …

被引用次数：74 相关文章所有 2 个版本

[PDF] neurips.cc

Marble: Music audio representation benchmark for universal evaluation

R Yuan, Y Ma, Y Li, G Zhang, X Chen… - Advances in …, 2023 - proceedings.neurips.cc

In the era of extensive intersection between art and Artificial Intelligence (AI), such as image
generation and fiction co-creation, AI for music remains relatively nascent, particularly in …

被引用次数：9 相关文章所有 7 个版本

[PDF] tagtraum.com

[PDF][PDF] A Single-Step Approach to Musical Tempo Estimation Using a Convolutional Neural Network.

H Schreiber, M Müller - Ismir, 2018 - tagtraum.com

We present a single-step musical tempo estimation system based solely on a convolutional
neural network (CNN). Contrary to existing systems, which typically first identify onsets or …

被引用次数：72 相关文章所有 7 个版本

[PDF] arxiv.org

On the effectiveness of speech self-supervised learning for music

Y Ma, R Yuan, Y Li, G Zhang, X Chen, H Yin… - arXiv preprint arXiv …, 2023 - arxiv.org

Self-supervised learning (SSL) has shown promising results in various speech and natural
language processing applications. However, its efficacy in music information retrieval (MIR) …

被引用次数：8 相关文章所有 12 个版本