nnaudio: An on-the-fly gpu audio to spectrogram conversion toolbox using 1d convolutional neural networks

KW Cheuk, H Anderson, K Agres, D Herremans - IEEE Access, 2020 - ieeexplore.ieee.org
In this paper, we present nnAudio, a new neural network-based audio processing framework
with graphics processing unit (GPU) support that leverages 1D convolutional neural …

Audio-based musical version identification: Elements and challenges

F Yesiler, G Doras, RM Bittner… - IEEE Signal …, 2021 - ieeexplore.ieee.org
Creating novel interpretations of existing musical compositions is and has always been an
essential part of musical practice. Before the advent of recorded music, listening to a piece of …

Accurate and scalable version identification using musically-motivated embeddings

F Yesiler, J Serrà, E Gómez - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
The version identification (VI) task deals with the automatic detection of recordings that
correspond to the same underlying musical piece. Despite many efforts, VI is still an open …

Discover: Disentangled music representation learning for cover song identification

J Xun, S Zhang, Y Yang, J Zhu, L Deng… - Proceedings of the 46th …, 2023 - dl.acm.org
In the field of music information retrieval (MIR), cover song identification (CSI) is a
challenging task that aims to identify cover versions of a query song from a massive …

Bytecover: Cover song identification via multi-loss training

X Du, Z Yu, B Zhu, X Chen, Z Ma - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
We present in this paper ByteCover, which is a new feature learning method for cover song
identification (CSI). Byte-Cover is built based on the classical ResNet model, and two major …

Learning a representation for cover song identification using convolutional neural network

Z Yu, X Xu, X Chen, D Yang - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
Cover song identification is a challenging task in the field of Music Information Retrieval
(MIR) due to complex musical variations between query tracks and cover versions. Previous …

CTC-based learning of chroma features for score–audio music retrieval

F Zalkow, M Müller - IEEE/ACM Transactions on Audio, Speech …, 2021 - ieeexplore.ieee.org
This paper deals with a score–audio music retrieval task where the aim is to find relevant
audio recordings of Western classical music, given a short monophonic musical theme in …

A prototypical triplet loss for cover detection

G Doras, G Peeters - ICASSP 2020-2020 IEEE International …, 2020 - ieeexplore.ieee.org
Automatic cover detection-the task of finding in an audio dataset all covers of a query track-
has long been a challenging theoretical problem in MIR community. It also became a …

Deep learning for audio and music

G Peeters, G Richard - Multi-faceted Deep Learning: Models and Data, 2021 - Springer
This chapter provides an overview of how deep learning techniques can be used for audio
signals. We first review the main DNN architectures, meta-architectures and training …

Combining musical features for cover detection

G Doras, F Yesiler, J Serrà Julià… - … J, Ha Lee J, McFee B …, 2020 - repositori.upf.edu
Recent works have addressed the automatic cover detection problem from a metric learning
perspective. They employ different input representations, aiming to exploit melodic or …