Sound event detection: A tutorial

A Mesaros, T Heittola, T Virtanen… - IEEE Signal …, 2021 - ieeexplore.ieee.org
Imagine standing on a street corner in the city. With your eyes closed you can hear and
recognize a succession of sounds: cars passing by, people speaking, their footsteps when …

A comprehensive review of polyphonic sound event detection

TK Chan, CS Chin - IEEE Access, 2020 - ieeexplore.ieee.org
One of the most amazing functions of the human auditory system is the ability to detect all
kinds of sound events in the environment. With the technologies and hardware advances …

Strong labeling of sound events using crowdsourced weak labels and annotator competence estimation

I Martín-Morató, A Mesaros - IEEE/ACM transactions on audio …, 2023 - ieeexplore.ieee.org
Crowdsourcing is a popular tool for collecting large amounts of annotated data, but the
specific format of the strong labels necessary for sound event detection is not easily …

Anomalous sound event detection: A survey of machine learning based methods and applications

Z Mnasri, S Rovetta, F Masulli - Multimedia Tools and Applications, 2022 - Springer
With the development of multi-modal man-machine interaction, audio signal analysis is
gaining importance in a field traditionally dominated by video. In particular, anomalous …

An improved mean teacher based method for large scale weakly labeled semi-supervised sound event detection

X Zheng, Y Song, I McLoughlin, L Liu… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
This paper presents an improved mean teacher (MT) based method for large-scale weakly
labeled semi-supervised sound event detection (SED), by focusing on learning a better …

CNN-transformer with self-attention network for sound event detection

K Wakayama, S Saito - ICASSP 2022-2022 IEEE International …, 2022 - ieeexplore.ieee.org
In sound event detection (SED), the representation ability of deep neural network (DNN)
models must be increased to significantly improve the accuracy or increase the number of …

Multi-dimensional frequency dynamic convolution with confident mean teacher for sound event detection

S Xiao, X Zhang, P Zhang - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Recently, convolutional neural networks (CNNs) have been widely used in sound event
detection (SED). However, traditional convolution is deficient in learning time-frequency …

Sound event detection transformer: An event-based end-to-end model for sound event detection

Z Ye, X Wang, H Liu, Y Qian, R Tao, L Yan… - arXiv preprint arXiv …, 2021 - arxiv.org
Sound event detection (SED) has gained increasing attention with its wide application in
surveillance, video indexing, etc. Existing models in SED mainly generate frame-level …

Cross-referencing self-training network for sound event detection in audio mixtures

S Park, DK Han, M Elhilali - IEEE transactions on multimedia, 2022 - ieeexplore.ieee.org
Sound event detection is an important facet of audio tagging that aims to identify sounds of
interest and define both the sound category and time boundaries for each sound event in a …

Self-training for sound event detection in audio mixtures

S Park, A Bellur, DK Han… - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Sound event detection (SED) takes on the task of identifying presence of specific sound
events in a complex audio recording. SED has tremendous implications in video analytics …