Convolutional recurrent neural networks for polyphonic sound event detection

E Cakır, G Parascandolo, T Heittola… - … on Audio, Speech …, 2017 - ieeexplore.ieee.org
Sound events often occur in unstructured environments where they exhibit wide variations in
their frequency content and temporal structure. Convolutional neural networks (CNNs) are …

Recurrent neural networks for polyphonic sound event detection in real life recordings

G Parascandolo, H Huttunen… - 2016 IEEE international …, 2016 - ieeexplore.ieee.org
In this paper we present an approach to polyphonic sound event detection in real life
recordings based on bi-directional long short term memory (BLSTM) recurrent neural …

Metrics for polyphonic sound event detection

A Mesaros, T Heittola, T Virtanen - Applied Sciences, 2016 - mdpi.com
This paper presents and discusses various metrics proposed for evaluation of polyphonic
sound event detection systems used in realistic situations where there are typically multiple …

Polyphonic sound event detection using multi label deep neural networks

E Cakir, T Heittola, H Huttunen… - 2015 international joint …, 2015 - ieeexplore.ieee.org
In this paper, the use of multi label neural networks are proposed for detection of temporally
overlapping sound events in realistic environments. Real-life sound recordings typically …

Adaptive pooling operators for weakly labeled sound event detection

B McFee, J Salamon, JP Bello - IEEE/ACM Transactions on …, 2018 - ieeexplore.ieee.org
Sound event detection (SED) methods are tasked with labeling segments of audio
recordings by the presence of active sound sources. SED is typically posed as a supervised …

Sound event detection in multichannel audio using spatial and harmonic features

S Adavanne, G Parascandolo, P Pertilä… - arXiv preprint arXiv …, 2017 - arxiv.org
In this paper, we propose the use of spatial and harmonic features in combination with long
short term memory (LSTM) recurrent neural network (RNN) for automatic sound event …

Sound event detection in real life recordings using coupled matrix factorization of spectral representations and class activity annotations

A Mesaros, T Heittola, O Dikmen… - 2015 IEEE international …, 2015 - ieeexplore.ieee.org
Methods for detection of overlapping sound events in audio involve matrix factorization
approaches, often assigning separated components to event classes. We present a method …

[PDF][PDF] Acoustic Event Detection Method Using Semi-Supervised Non-Negative Matrix Factorization with Mixtures of Local Dictionaries.

T Komatsu, T Toizumi, R Kondo, Y Senda - DCASE, 2016 - dcase.community
This paper proposes an acoustic event detection (AED) method using semi-supervised non-
negative matrix factorization (NMF) with a mixture of local dictionaries (MLD). The proposed …

Sound event localization and detection using CRNN on pairs of microphones

F Grondin, J Glass, I Sobieraj, MD Plumbley - arXiv preprint arXiv …, 2019 - arxiv.org
This paper proposes sound event localization and detection methods from multichannel
recording. The proposed system is based on two Convolutional Recurrent Neural Networks …

Acoustic event detection based on non-negative matrix factorization with mixtures of local dictionaries and activation aggregation

T Komatsu, Y Senda, R Kondo - 2016 IEEE International …, 2016 - ieeexplore.ieee.org
This paper proposes a new non-negative matrix factorization (NMF) based acoustic event
detection (AED) method with mixtures of local dictionaries (MLD) and activation aggregation …