How to (virtually) train your speaker localizer
Learning-based methods have become ubiquitous in speaker localization. Existing systems
rely on simulated training sets for the lack of sufficiently large, diverse and annotated real …
rely on simulated training sets for the lack of sufficiently large, diverse and annotated real …
Lcanets++: Robust audio classification using multi-layer neural networks with lateral competition
Audio classification aims at recognizing audio signals, including speech commands or
sound events. However, current audio classifiers are susceptible to perturbations and …
sound events. However, current audio classifiers are susceptible to perturbations and …
Database of simulated room impulse responses for acoustic sensor networks deployed in complex multi-source acoustic environments
In this work we present a large set of simulated room impulse responses for a multi-room
apartment. The simulated apartment models a real vacation apartment for which a recorded …
apartment. The simulated apartment models a real vacation apartment for which a recorded …
[PDF][PDF] Sound event localization and detection with pre-trained audio spectrogram transformer and multichannel separation network
We propose a sound event localization and detection system based on a CNN-Conformer
base network. Our main contribution is to evaluate the use of pre-trained elements in this …
base network. Our main contribution is to evaluate the use of pre-trained elements in this …
[PDF][PDF] How to (virtually) train your sound source localizer
P Srivastava, A Deleforge, A Politis… - arXiv preprint arXiv …, 2022 - hal.science
Learning-based methods have become ubiquitous in sound source localization (SSL).
Existing systems rely on simulated training sets for the lack of sufficiently large, diverse and …
Existing systems rely on simulated training sets for the lack of sufficiently large, diverse and …
Synthetic data generation techniques for training deep acoustic siren identification networks
Acoustic sensing has been widely exploited for the early detection of harmful situations in
urban environments: in particular, several siren identification algorithms based on deep …
urban environments: in particular, several siren identification algorithms based on deep …
Selective-Memory Meta-Learning with Environment Representations for Sound Event Localization and Detection
Environment shifts and conflicts present significant challenges for learning-based sound
event localization and detection (SELD) methods. SELD systems, when trained in particular …
event localization and detection (SELD) methods. SELD systems, when trained in particular …
Loss function design for DNN-based sound event localization and detection on low-resource realistic data
This study focuses on the design of a loss function for a deep neural network (DNN)-based
model with two branches, which is used to solve sound event localization and detection …
model with two branches, which is used to solve sound event localization and detection …
PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
Sound event localization and detection (SELD) has seen substantial advancements through
learning-based methods. These systems, typically trained from scratch on specific datasets …
learning-based methods. These systems, typically trained from scratch on specific datasets …
Exploring Self-supervised Contrastive Learning of Spatial Sound Event Representation
In this study, we present a simple multi-channel framework for contrastive learning (MC-
SimCLR) to encode 'what'and 'where'of spatial audios. MC-SimCLR learns joint spectral and …
SimCLR) to encode 'what'and 'where'of spatial audios. MC-SimCLR learns joint spectral and …