Steered response power for sound source localization: A tutorial review
In the last three decades, the Steered Response Power (SRP) method has been widely
used for the task of Sound Source Localization (SSL), due to its satisfactory localization …
used for the task of Sound Source Localization (SSL), due to its satisfactory localization …
Creating speech zones with self-distributing acoustic swarms
Imagine being in a crowded room with a cacophony of speakers and having the ability to
focus on or remove speech from a specific 2D region. This would require understanding and …
focus on or remove speech from a specific 2D region. This would require understanding and …
Configurable doa estimation using incremental learning
This study introduces a progressive neural network (PNN) model for direction of arrival
(DOA) estimation, DOA-PNN, addressing the challenge due to catastrophic forgetting in …
(DOA) estimation, DOA-PNN, addressing the challenge due to catastrophic forgetting in …
A hybrid neural coding approach for pattern recognition with spiking neural networks
Recently, brain-inspired spiking neural networks (SNNs) have demonstrated promising
capabilities in solving pattern recognition tasks. However, these SNNs are grounded on …
capabilities in solving pattern recognition tasks. However, these SNNs are grounded on …
RealMAN: A real-recorded and annotated microphone array dataset for dynamic speech enhancement and localization
The training of deep learning-based multichannel speech enhancement and source
localization systems relies heavily on the simulation of room impulse response and …
localization systems relies heavily on the simulation of room impulse response and …
Tf-mamba: A time-frequency network for sound source localization
Sound source localization (SSL) determines the position of sound sources using multi-
channel audio data. It is commonly used to improve speech enhancement and separation …
channel audio data. It is commonly used to improve speech enhancement and separation …
FN-SSL: Full-band and narrow-band fusion for sound source localization
Extracting direct-path spatial features is critical for sound source localization in adverse
acoustic environments. This paper proposes a full-band and narrow-band fusion network for …
acoustic environments. This paper proposes a full-band and narrow-band fusion network for …
BAT: Learning to Reason about Spatial Sounds with Large Language Models
Spatial sound reasoning is a fundamental human skill, enabling us to navigate and interpret
our surroundings based on sound. In this paper we present BAT, which combines the spatial …
our surroundings based on sound. In this paper we present BAT, which combines the spatial …
IFAN: An Icosahedral Feature Attention Network for Sound Source Localization
XC Zhu, H Zhang, HT Feng, DH Zhao… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Currently, sound source localization (SSL) techniques based on deep learning mainly rely
on traditional signal processing methods to generate input features. Nevertheless, the …
on traditional signal processing methods to generate input features. Nevertheless, the …
Spike-based Neuromorphic Model for Sound Source Localization
Biological systems possess remarkable sound source localization (SSL) capabilities that are
critical for survival in complex environments. This ability arises from the collaboration …
critical for survival in complex environments. This ability arises from the collaboration …