Timbre analysis of music audio signals with convolutional neural networks

J Pons, O Slizovskaia, R Gong… - 2017 25th European …, 2017 - ieeexplore.ieee.org
The focus of this work is to study how to efficiently tailor Convolutional Neural Networks
(CNNs) towards learning timbre representations from log-mel magnitude spectrograms. We …

End-to-end learning for music audio tagging at scale

J Pons, O Nieto, M Prockup, E Schmidt… - arXiv preprint arXiv …, 2017 - arxiv.org
The lack of data tends to limit the outcomes of deep learning research, particularly when
dealing with end-to-end learning stacks processing raw data such as waveforms. In this …

Sample-level deep convolutional neural networks for music auto-tagging using raw waveforms

J Lee, J Park, KL Kim, J Nam - arXiv preprint arXiv:1703.01789, 2017 - arxiv.org
Recently, the end-to-end approach that learns hierarchical representations from raw data
using deep convolutional neural networks has been successfully explored in the image, text …

SampleCNN: End-to-end deep convolutional neural networks using very small filters for music classification

J Lee, J Park, KL Kim, J Nam - Applied Sciences, 2018 - mdpi.com
Convolutional Neural Networks (CNN) have been applied to diverse machine learning tasks
for different modalities of raw data in an end-to-end fashion. In the audio domain, a raw …

Multimodal deep learning for music genre classification

S Oramas, F Barbieri, O Nieto Caballero… - Transactions of the …, 2018 - repositori.upf.edu
Music genre labels are useful to organize songs, albums, and artists into broader groups
that share similar musical characteristics. In this work, an approach to learn and combine …

The receptive field as a regularizer in deep convolutional neural networks for acoustic scene classification

K Koutini, H Eghbal-Zadeh, M Dorfer… - 2019 27th European …, 2019 - ieeexplore.ieee.org
Convolutional Neural Networks (CNNs) have had great success in many machine vision as
well as machine audition tasks. Many image recognition network architectures have …

Multi-label music genre classification from audio, text, and images using deep features

S Oramas, O Nieto, F Barbieri, X Serra - arXiv preprint arXiv:1707.04916, 2017 - arxiv.org
Music genres allow to categorize musical items that share common characteristics. Although
these categories are not mutually exclusive, most related research is traditionally focused on …

Bottom-up broadcast neural network for music genre classification

C Liu, L Feng, G Liu, H Wang, S Liu - Multimedia Tools and Applications, 2021 - Springer
Music genre classification based on visual representation has been successfully explored
over the last years. Recently, there has been increasing interest in attempting convolutional …

musicnn: Pre-trained convolutional neural networks for music audio tagging

J Pons, X Serra - arXiv preprint arXiv:1909.06654, 2019 - arxiv.org
Pronounced as" musician", the musicnn library contains a set of pre-trained musically
motivated convolutional neural networks for music audio tagging: https://github …

Multi-level and multi-scale feature aggregation using pretrained convolutional neural networks for music auto-tagging

J Lee, J Nam - IEEE signal processing letters, 2017 - ieeexplore.ieee.org
Music auto-tagging is often handled in a similar manner to image classification by regarding
the two-dimensional audio spectrogram as image data. However, music auto-tagging is …