Srp-dnn: Learning direct-path phase difference for multiple moving sound source localization

E Grinstein, E Tengan, B Çakmak, T Dietzen… - EURASIP Journal on …, 2024 - Springer

In the last three decades, the Steered Response Power (SRP) method has been widely
used for the task of Sound Source Localization (SSL), due to its satisfactory localization …

被引用次数：2 相关文章所有 2 个版本

[PDF] nature.com

Creating speech zones with self-distributing acoustic swarms

M Itani, T Chen, T Yoshioka, S Gollakota - Nature Communications, 2023 - nature.com

Imagine being in a crowded room with a cacophony of speakers and having the ability to
focus on or remove speech from a specific 2D region. This would require understanding and …

被引用次数：8 相关文章所有 10 个版本

[PDF] arxiv.org

Configurable doa estimation using incremental learning

Y Xiao, RK Das - arXiv preprint arXiv:2407.03661, 2024 - arxiv.org

This study introduces a progressive neural network (PNN) model for direction of arrival
(DOA) estimation, DOA-PNN, addressing the challenge due to catastrophic forgetting in …

被引用次数：6 相关文章所有 2 个版本

[PDF] arxiv.org

A hybrid neural coding approach for pattern recognition with spiking neural networks

X Chen, Q Yang, J Wu, H Li… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Recently, brain-inspired spiking neural networks (SNNs) have demonstrated promising
capabilities in solving pattern recognition tasks. However, these SNNs are grounded on …

被引用次数：19 相关文章所有 6 个版本

[PDF] arxiv.org

RealMAN: A real-recorded and annotated microphone array dataset for dynamic speech enhancement and localization

B Yang, C Quan, Y Wang, P Wang, Y Yang… - arXiv preprint arXiv …, 2024 - arxiv.org

The training of deep learning-based multichannel speech enhancement and source
localization systems relies heavily on the simulation of room impulse response and …

被引用次数：3 相关文章所有 4 个版本

[PDF] arxiv.org

Tf-mamba: A time-frequency network for sound source localization

Y Xiao, RK Das - arXiv preprint arXiv:2409.05034, 2024 - arxiv.org

Sound source localization (SSL) determines the position of sound sources using multi-
channel audio data. It is commonly used to improve speech enhancement and separation …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

FN-SSL: Full-band and narrow-band fusion for sound source localization

Y Wang, B Yang, X Li - arXiv preprint arXiv:2305.19610, 2023 - arxiv.org

Extracting direct-path spatial features is critical for sound source localization in adverse
acoustic environments. This paper proposes a full-band and narrow-band fusion network for …

被引用次数：11 相关文章所有 4 个版本

[PDF] arxiv.org

BAT: Learning to Reason about Spatial Sounds with Large Language Models

Z Zheng, P Peng, Z Ma, X Chen, E Choi… - arXiv preprint arXiv …, 2024 - arxiv.org

Spatial sound reasoning is a fundamental human skill, enabling us to navigate and interpret
our surroundings based on sound. In this paper we present BAT, which combines the spatial …

被引用次数：9 相关文章所有 3 个版本

IFAN: An Icosahedral Feature Attention Network for Sound Source Localization

XC Zhu, H Zhang, HT Feng, DH Zhao… - IEEE Transactions …, 2024 - ieeexplore.ieee.org

Currently, sound source localization (SSL) techniques based on deep learning mainly rely
on traditional signal processing methods to generate input features. Nevertheless, the …

被引用次数：3 相关文章所有 2 个版本

[PDF] openreview.net

Spike-based Neuromorphic Model for Sound Source Localization

D Zhang, S Wang, A Belatreche, W Wei… - The Thirty-eighth …, 2024 - openreview.net

Biological systems possess remarkable sound source localization (SSL) capabilities that are
critical for survival in complex environments. This ability arises from the collaboration …

被引用次数：1 相关文章