Spatial data augmentation with simulated room impulse responses for sound event localization...

P Srivastava, A Deleforge, A Politis… - arXiv preprint arXiv …, 2022 - arxiv.org

Learning-based methods have become ubiquitous in speaker localization. Existing systems
rely on simulated training sets for the lack of sufficiently large, diverse and annotated real …

被引用次数：9 相关文章所有 19 个版本

[PDF] arxiv.org

Lcanets++: Robust audio classification using multi-layer neural networks with lateral competition

SV Dibbo, JS Moore, GT Kenyon… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org

Audio classification aims at recognizing audio signals, including speech commands or
sound events. However, current audio classifiers are susceptible to perturbations and …

被引用次数：8 相关文章所有 2 个版本

[PDF] eurasip.org

Database of simulated room impulse responses for acoustic sensor networks deployed in complex multi-source acoustic environments

R Glitza, L Becker, A Nelus… - 2023 31st European …, 2023 - ieeexplore.ieee.org

In this work we present a large set of simulated room impulse responses for a multi-room
apartment. The simulated apartment models a real vacation apartment for which a recorded …

被引用次数：6 相关文章所有 2 个版本

[PDF] dcase.community

[PDF][PDF] Sound event localization and detection with pre-trained audio spectrogram transformer and multichannel separation network

R Scheibler, T Komatsu, Y Fujita, M Hentschel - omni (1ch), 2022 - dcase.community

We propose a sound event localization and detection system based on a CNN-Conformer
base network. Our main contribution is to evaluate the use of pre-trained elements in this …

被引用次数：9 相关文章所有 2 个版本

[PDF] hal.science

[PDF][PDF] How to (virtually) train your sound source localizer

P Srivastava, A Deleforge, A Politis… - arXiv preprint arXiv …, 2022 - hal.science

Learning-based methods have become ubiquitous in sound source localization (SSL).
Existing systems rely on simulated training sets for the lack of sufficiently large, diverse and …

被引用次数：5 相关文章所有 14 个版本

[PDF] frontiersin.org

Synthetic data generation techniques for training deep acoustic siren identification networks

S Damiano, B Cramer, A Guntoro… - Frontiers in Signal …, 2024 - frontiersin.org

Acoustic sensing has been widely exploited for the early detection of harmful situations in
urban environments: in particular, several siren identification algorithms based on deep …

被引用次数：1 相关文章

[PDF] arxiv.org

Selective-Memory Meta-Learning with Environment Representations for Sound Event Localization and Detection

J Hu, Y Cao, M Wu, Q Kong, F Yang… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org

Environment shifts and conflicts present significant challenges for learning-based sound
event localization and detection (SELD) methods. SELD systems, when trained in particular …

被引用次数：1 相关文章所有 2 个版本

Loss function design for DNN-based sound event localization and detection on low-resource realistic data

Q Wang, J Du, Z Nian, S Niu, L Chai… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

This study focuses on the design of a loss function for a deep neural network (DNN)-based
model with two branches, which is used to solve sound event localization and detection …

被引用次数：4 相关文章

[PDF] arxiv.org

PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection

J Hu, Y Cao, M Wu, F Kang, F Yang, W Wang… - arXiv preprint arXiv …, 2024 - arxiv.org

Sound event localization and detection (SELD) has seen substantial advancements through
learning-based methods. These systems, typically trained from scratch on specific datasets …

Exploring Self-supervised Contrastive Learning of Spatial Sound Event Representation

X Jiang, C Han, YA Li… - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org

In this study, we present a simple multi-channel framework for contrastive learning (MC-
SimCLR) to encode 'what'and 'where'of spatial audios. MC-SimCLR learns joint spectral and …

被引用次数：1 相关文章所有 3 个版本