How to (virtually) train your speaker localizer

P Srivastava, A Deleforge, A Politis… - arXiv preprint arXiv …, 2022 - arxiv.org
Learning-based methods have become ubiquitous in speaker localization. Existing systems
rely on simulated training sets for the lack of sufficiently large, diverse and annotated real …

Lcanets++: Robust audio classification using multi-layer neural networks with lateral competition

SV Dibbo, JS Moore, GT Kenyon… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
Audio classification aims at recognizing audio signals, including speech commands or
sound events. However, current audio classifiers are susceptible to perturbations and …

Database of simulated room impulse responses for acoustic sensor networks deployed in complex multi-source acoustic environments

R Glitza, L Becker, A Nelus… - 2023 31st European …, 2023 - ieeexplore.ieee.org
In this work we present a large set of simulated room impulse responses for a multi-room
apartment. The simulated apartment models a real vacation apartment for which a recorded …

[PDF][PDF] Sound event localization and detection with pre-trained audio spectrogram transformer and multichannel separation network

R Scheibler, T Komatsu, Y Fujita, M Hentschel - omni (1ch), 2022 - dcase.community
We propose a sound event localization and detection system based on a CNN-Conformer
base network. Our main contribution is to evaluate the use of pre-trained elements in this …

[PDF][PDF] How to (virtually) train your sound source localizer

P Srivastava, A Deleforge, A Politis… - arXiv preprint arXiv …, 2022 - hal.science
Learning-based methods have become ubiquitous in sound source localization (SSL).
Existing systems rely on simulated training sets for the lack of sufficiently large, diverse and …

Synthetic data generation techniques for training deep acoustic siren identification networks

S Damiano, B Cramer, A Guntoro… - Frontiers in Signal …, 2024 - frontiersin.org
Acoustic sensing has been widely exploited for the early detection of harmful situations in
urban environments: in particular, several siren identification algorithms based on deep …

Selective-Memory Meta-Learning with Environment Representations for Sound Event Localization and Detection

J Hu, Y Cao, M Wu, Q Kong, F Yang… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org
Environment shifts and conflicts present significant challenges for learning-based sound
event localization and detection (SELD) methods. SELD systems, when trained in particular …

Loss function design for DNN-based sound event localization and detection on low-resource realistic data

Q Wang, J Du, Z Nian, S Niu, L Chai… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
This study focuses on the design of a loss function for a deep neural network (DNN)-based
model with two branches, which is used to solve sound event localization and detection …

PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection

J Hu, Y Cao, M Wu, F Kang, F Yang, W Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
Sound event localization and detection (SELD) has seen substantial advancements through
learning-based methods. These systems, typically trained from scratch on specific datasets …

Exploring Self-supervised Contrastive Learning of Spatial Sound Event Representation

X Jiang, C Han, YA Li… - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
In this study, we present a simple multi-channel framework for contrastive learning (MC-
SimCLR) to encode 'what'and 'where'of spatial audios. MC-SimCLR learns joint spectral and …