A sequence matching network for polyphonic sound event localization and detection

TNT Nguyen, DL Jones, WS Gan - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
Polyphonic sound event detection and direction-of-arrival estimation require different input
features from audio signals. While sound event detection mainly relies on time-frequency …

A general network architecture for sound event localization and detection using transfer learning and recurrent neural network

TNT Nguyen, NK Nguyen, H Phan… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Polyphonic sound event detection and localization (SELD) task is challenging because it is
difficult to jointly optimize sound event detection (SED) and direction-of-arrival (DOA) …

SoundSynp: Sound Source Detection from Raw Waveforms with Multi-Scale Synperiodic Filterbanks

Y He, A Markham - International Conference on Artificial …, 2023 - proceedings.mlr.press
We propose synperiodic filter banks, a novel multi-scale learnable filter bank construction
strategy that all filters are synchronized by their rotating periodicity. By synchronizing in a …

[PDF][PDF] Acoustic scene classification using a CNN-SuperVector system trained with auditory and spectrogram image features.

R Hyder, S Ghaffarzadegan, Z Feng, JHL Hansen… - Interspeech, 2017 - researchgate.net
Enabling smart devices to infer about the environment using audio signals has been one of
the several long-standing challenges in machine listening. The availability of public-domain …

[PDF][PDF] Ensemble of Sequence Matching Networks for Dynamic Sound Event Localization, Detection, and Tracking.

TNT Nguyen, DL Jones, WS Gan - DCASE, 2020 - dcase.community
Sound event localization and detection consists of two subtasks which are sound event
detection and direction-of-arrival estimation. While sound event detection mainly relies on …

[PDF][PDF] Sound Event Detection: A Journey Through DCASE Challenge Series

T Khandelwal, RK Das, ES Chng - APSIPA Transactions on …, 2024 - nowpublishers.com
The sense of hearing is fundamental to human beings, as it allows them to perceive their
surroundings. However, this simple task of recognizing different sounds in complex …

Synthetic data generation techniques for training deep acoustic siren identification networks

S Damiano, B Cramer, A Guntoro… - Frontiers in Signal …, 2024 - frontiersin.org
Acoustic sensing has been widely exploited for the early detection of harmful situations in
urban environments: in particular, several siren identification algorithms based on deep …

A dataset for audio-visual sound event detection in movies

R Hebbar, D Bose, K Somandepalli… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Audio event detection is a widely studied field, with applications ranging from self-driving
cars to healthcare. In-the-wild datasets such as Audioset have propelled research in this …

What makes sound event localization and detection difficult? insights from error analysis

TNT Nguyen, KN Watcharasupat, ZJ Lee… - arXiv preprint arXiv …, 2021 - arxiv.org
Sound event localization and detection (SELD) is an emerging research topic that aims to
unify the tasks of sound event detection and direction-of-arrival estimation. As a result, SELD …

Echo-aware adaptation of sound event localization and detection in unknown environments

M Yasuda, Y Ohishi, S Saito - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Our goal is to develop a sound event localization and detection (SELD) system that works
robustly in unknown environments. A SELD system trained on known environment data is …