Steered response power for sound source localization: A tutorial review

E Grinstein, E Tengan, B Çakmak, T Dietzen… - EURASIP Journal on …, 2024 - Springer
In the last three decades, the Steered Response Power (SRP) method has been widely
used for the task of Sound Source Localization (SSL), due to its satisfactory localization …

Creating speech zones with self-distributing acoustic swarms

M Itani, T Chen, T Yoshioka, S Gollakota - Nature Communications, 2023 - nature.com
Imagine being in a crowded room with a cacophony of speakers and having the ability to
focus on or remove speech from a specific 2D region. This would require understanding and …

Configurable doa estimation using incremental learning

Y Xiao, RK Das - arXiv preprint arXiv:2407.03661, 2024 - arxiv.org
This study introduces a progressive neural network (PNN) model for direction of arrival
(DOA) estimation, DOA-PNN, addressing the challenge due to catastrophic forgetting in …

A hybrid neural coding approach for pattern recognition with spiking neural networks

X Chen, Q Yang, J Wu, H Li… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Recently, brain-inspired spiking neural networks (SNNs) have demonstrated promising
capabilities in solving pattern recognition tasks. However, these SNNs are grounded on …

RealMAN: A real-recorded and annotated microphone array dataset for dynamic speech enhancement and localization

B Yang, C Quan, Y Wang, P Wang, Y Yang… - arXiv preprint arXiv …, 2024 - arxiv.org
The training of deep learning-based multichannel speech enhancement and source
localization systems relies heavily on the simulation of room impulse response and …

Tf-mamba: A time-frequency network for sound source localization

Y Xiao, RK Das - arXiv preprint arXiv:2409.05034, 2024 - arxiv.org
Sound source localization (SSL) determines the position of sound sources using multi-
channel audio data. It is commonly used to improve speech enhancement and separation …

FN-SSL: Full-band and narrow-band fusion for sound source localization

Y Wang, B Yang, X Li - arXiv preprint arXiv:2305.19610, 2023 - arxiv.org
Extracting direct-path spatial features is critical for sound source localization in adverse
acoustic environments. This paper proposes a full-band and narrow-band fusion network for …

BAT: Learning to Reason about Spatial Sounds with Large Language Models

Z Zheng, P Peng, Z Ma, X Chen, E Choi… - arXiv preprint arXiv …, 2024 - arxiv.org
Spatial sound reasoning is a fundamental human skill, enabling us to navigate and interpret
our surroundings based on sound. In this paper we present BAT, which combines the spatial …

IFAN: An Icosahedral Feature Attention Network for Sound Source Localization

XC Zhu, H Zhang, HT Feng, DH Zhao… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Currently, sound source localization (SSL) techniques based on deep learning mainly rely
on traditional signal processing methods to generate input features. Nevertheless, the …

Spike-based Neuromorphic Model for Sound Source Localization

D Zhang, S Wang, A Belatreche, W Wei… - The Thirty-eighth …, 2024 - openreview.net
Biological systems possess remarkable sound source localization (SSL) capabilities that are
critical for survival in complex environments. This ability arises from the collaboration …