A tutorial on deep learning for music information retrieval

K Choi, G Fazekas, K Cho, M Sandler - arXiv preprint arXiv:1709.04396, 2017 - arxiv.org
Following their success in Computer Vision and other areas, deep learning techniques have
recently become widely adopted in Music Information Retrieval (MIR) research. However …

Madmom: A new python audio and music signal processing library

S Böck, F Korzeniowski, J Schlüter, F Krebs… - Proceedings of the 24th …, 2016 - dl.acm.org
In this paper, we present madmom, an open-source audio processing and music information
retrieval (MIR) library written in Python. madmom features a concise, NumPy-compatible …

Challenges and opportunities of predicting musical emotions with perceptual and automatized features

EB Lange, K Frieler - Music Perception: An Interdisciplinary …, 2018 - online.ucpress.edu
Music information retrieval (MIR) is a fast-growing research area. One of its aims is to extract
musical characteristics from audio. In this study, we assumed the roles of researchers …

Feature learning for chord recognition: The deep chroma extractor

F Korzeniowski, G Widmer - arXiv preprint arXiv:1612.05065, 2016 - arxiv.org
We explore frame-level audio feature learning for chord recognition using artificial neural
networks. We present the argument that chroma vectors potentially hold enough information …

[PDF][PDF] Multi-Task Learning of Tempo and Beat: Learning One to Improve the Other.

S Böck, MEP Davies, P Knees - ISMIR, 2019 - archives.ismir.net
We propose a multi-task learning approach for simultaneous tempo estimation and beat
tracking of musical audio. The system shows state-of-the-art performance for both tasks on a …

[PDF][PDF] A Single-Step Approach to Musical Tempo Estimation Using a Convolutional Neural Network.

H Schreiber, M Müller - Ismir, 2018 - tagtraum.com
We present a single-step musical tempo estimation system based solely on a convolutional
neural network (CNN). Contrary to existing systems, which typically first identify onsets or …

Imposing higher-level structure in polyphonic music generation using convolutional restricted boltzmann machines and constraints

S Lattner, M Grachten, G Widmer - Journal of Creative Music …, 2018 - search.informit.org
We introduce a method for imposing higher-level structure on generated, polyphonic music.
A Convolutional Restricted Boltzmann Machine (C-RBM) as a generative model is combined …

Designing efficient architectures for modeling temporal features with convolutional neural networks

J Pons, X Serra - … Conference on Acoustics, Speech and Signal …, 2017 - ieeexplore.ieee.org
Many researchers use convolutional neural networks with small rectangular filters for music
(spectrograms) classification. First, we discuss why there is no reason to use this filters setup …

Disentangled multidimensional metric learning for music similarity

J Lee, NJ Bryan, J Salamon, Z Jin… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Music similarity search is useful for a variety of creative tasks such as replacing one music
recording with another recording with a similar" feel", a common task in video editing. For …

Deep ranking: Triplet MatchNet for music metric learning

R Lu, K Wu, Z Duan, C Zhang - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
Metric learning for music is an important problem for many music information retrieval (MIR)
applications such as music generation, analysis, retrieval, classification and …