nnaudio: An on-the-fly gpu audio to spectrogram conversion toolbox using 1d convolutional neural networks
In this paper, we present nnAudio, a new neural network-based audio processing framework
with graphics processing unit (GPU) support that leverages 1D convolutional neural …
with graphics processing unit (GPU) support that leverages 1D convolutional neural …
Audio-based musical version identification: Elements and challenges
Creating novel interpretations of existing musical compositions is and has always been an
essential part of musical practice. Before the advent of recorded music, listening to a piece of …
essential part of musical practice. Before the advent of recorded music, listening to a piece of …
Accurate and scalable version identification using musically-motivated embeddings
The version identification (VI) task deals with the automatic detection of recordings that
correspond to the same underlying musical piece. Despite many efforts, VI is still an open …
correspond to the same underlying musical piece. Despite many efforts, VI is still an open …
Discover: Disentangled music representation learning for cover song identification
In the field of music information retrieval (MIR), cover song identification (CSI) is a
challenging task that aims to identify cover versions of a query song from a massive …
challenging task that aims to identify cover versions of a query song from a massive …
Bytecover: Cover song identification via multi-loss training
We present in this paper ByteCover, which is a new feature learning method for cover song
identification (CSI). Byte-Cover is built based on the classical ResNet model, and two major …
identification (CSI). Byte-Cover is built based on the classical ResNet model, and two major …
Learning a representation for cover song identification using convolutional neural network
Cover song identification is a challenging task in the field of Music Information Retrieval
(MIR) due to complex musical variations between query tracks and cover versions. Previous …
(MIR) due to complex musical variations between query tracks and cover versions. Previous …
CTC-based learning of chroma features for score–audio music retrieval
This paper deals with a score–audio music retrieval task where the aim is to find relevant
audio recordings of Western classical music, given a short monophonic musical theme in …
audio recordings of Western classical music, given a short monophonic musical theme in …
A prototypical triplet loss for cover detection
Automatic cover detection-the task of finding in an audio dataset all covers of a query track-
has long been a challenging theoretical problem in MIR community. It also became a …
has long been a challenging theoretical problem in MIR community. It also became a …
Deep learning for audio and music
This chapter provides an overview of how deep learning techniques can be used for audio
signals. We first review the main DNN architectures, meta-architectures and training …
signals. We first review the main DNN architectures, meta-architectures and training …
Combining musical features for cover detection
Recent works have addressed the automatic cover detection problem from a metric learning
perspective. They employ different input representations, aiming to exploit melodic or …
perspective. They employ different input representations, aiming to exploit melodic or …