MT3: Multi-task multitrack music transcription

J Gardner, I Simon, E Manilow, C Hawthorne… - arXiv preprint arXiv …, 2021 - arxiv.org
Automatic Music Transcription (AMT), inferring musical notes from raw audio, is a
challenging task at the core of music understanding. Unlike Automatic Speech Recognition …

ASAP: a dataset of aligned scores and performances for piano transcription

F Foscarin, A Mcleod, P Rigaux… - … Society for Music …, 2020 - infoscience.epfl.ch
In this paper we present Aligned Scores and Performances (ASAP): a new dataset of 222
digital musical scores aligned with 1068 performances (more than 92 hours) of Western …

Multitrack music transcription with a time-frequency perceiver

WT Lu, JC Wang, YN Hung - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Multitrack music transcription aims to transcribe a music audio input into the musical notes of
multiple instruments simultaneously. It is a very challenging task that typically requires a …

Mrbert: Pre-training of melody and rhythm for automatic music generation

S Li, Y Sung - Mathematics, 2023 - mdpi.com
Deep learning technology has been extensively studied for its potential in music, notably for
creative music generation research. Traditional music generation approaches based on …

A holistic approach to polyphonic music transcription with neural networks

MA Román, A Pertusa, J Calvo-Zaragoza - arXiv preprint arXiv …, 2019 - arxiv.org
We present a framework based on neural networks to extract music scores directly from
polyphonic audio in an end-to-end fashion. Most previous Automatic Music Transcription …

Non-local musical statistics as guides for audio-to-score piano transcription

K Shibata, E Nakamura, K Yoshii - Information Sciences, 2021 - Elsevier
We present an automatic piano transcription system that converts polyphonic audio
recordings into musical scores. This has been a long-standing problem of music information …

Data representations for audio-to-score monophonic music transcription

MA Román, A Pertusa, J Calvo-Zaragoza - Expert Systems with …, 2020 - Elsevier
This work presents an end-to-end method based on deep neural networks for audio-to-score
music transcription of monophonic excerpts. Unlike existing music transcription methods …

Sheet music transformer: End-to-end optical music recognition beyond monophonic transcription

A Ríos-Vila, J Calvo-Zaragoza, T Paquet - International Conference on …, 2024 - Springer
State-of-the-art end-to-end Optical Music Recognition (OMR) has, to date, primarily been
carried out using monophonic transcription techniques to handle complex score layouts …

vocadito: A dataset of solo vocals with , note, and lyric annotations

RM Bittner, K Pasalo, JJ Bosch… - arXiv preprint arXiv …, 2021 - arxiv.org
To compliment the existing set of datasets, we present a small dataset entitled vocadito,
consisting of 40 short excerpts of monophonic singing, sung in 7 different languages by …

Investigating the perceptual validity of evaluation metrics for automatic piano music transcription

A Ycart, L Liu, E Benetos, M Pearce - Transactions of the …, 2020 - qmro.qmul.ac.uk
Automatic Music Transcription (AMT) is usually evaluated using low-level criteria, typically
by counting the numbers of errors, with equal weighting. Yet, some errors (eg out-of-key …