Speaker identification through artificial intelligence techniques: A comprehensive review and research challenges
Speech is a powerful medium of communication that always convey rich and useful
information, such as gender, accent, and other unique characteristics of a speaker. These …
information, such as gender, accent, and other unique characteristics of a speaker. These …
Content-driven music recommendation: Evolution, state of the art, and challenges
The music domain is among the most important ones for adopting recommender systems
technology. In contrast to most other recommendation domains, which predominantly rely on …
technology. In contrast to most other recommendation domains, which predominantly rely on …
Deep convolutional neural networks and data augmentation for environmental sound classification
The ability of deep convolutional neural networks (CNNs) to learn discriminative spectro-
temporal patterns makes them well suited to environmental sound classification. However …
temporal patterns makes them well suited to environmental sound classification. However …
[PDF][PDF] librosa: Audio and music signal analysis in python.
This document describes version 0.4. 0 of librosa: a Python package for audio and music
signal processing. At a high level, librosa provides implementations of a variety of common …
signal processing. At a high level, librosa provides implementations of a variety of common …
Internet of musical things: Vision and challenges
The Internet of Musical Things (IoMusT) is an emerging research field positioned at the
intersection of Internet of Things, new interfaces for musical expression, ubiquitous music …
intersection of Internet of Things, new interfaces for musical expression, ubiquitous music …
Contrastive learning of musical representations
J Spijkervet, JA Burgoyne - arXiv preprint arXiv:2103.09410, 2021 - arxiv.org
While deep learning has enabled great advances in many areas of music, labeled music
datasets remain especially hard, expensive, and time-consuming to create. In this work, we …
datasets remain especially hard, expensive, and time-consuming to create. In this work, we …
Uncertainty and surprise jointly predict musical pleasure and amygdala, hippocampus, and auditory cortex activity
Listening to music often evokes intense emotions [1, 2]. Recent research suggests that
musical pleasure comes from positive reward prediction errors, which arise when what is …
musical pleasure comes from positive reward prediction errors, which arise when what is …
Audio features for music emotion recognition: a survey
The design of meaningful audio features is a key need to advance the state-of-the-art in
music emotion recognition (MER). This article presents a survey on the existing emotionally …
music emotion recognition (MER). This article presents a survey on the existing emotionally …
Freesound datasets: a platform for the creation of open audio datasets
Openly available datasets are a key factor in the advancement of data-driven research
approaches, including many of the ones used in sound and music computing. In the last few …
approaches, including many of the ones used in sound and music computing. In the last few …
Scaper: A library for soundscape synthesis and augmentation
J Salamon, D MacConnell, M Cartwright… - … IEEE Workshop on …, 2017 - ieeexplore.ieee.org
Sound event detection (SED) in environmental recordings is a key topic of research in
machine listening, with applications in noise monitoring for smart cities, self-driving cars …
machine listening, with applications in noise monitoring for smart cities, self-driving cars …