[HTML][HTML] A survey of sound source localization with deep learning methods

PA Grumiaux, S Kitić, L Girin, A Guérin - The Journal of the Acoustical …, 2022 - pubs.aip.org
This article is a survey of deep learning methods for single and multiple sound source
localization, with a focus on sound source localization in indoor environments, where …

Sound event localization and detection of overlapping sources using convolutional recurrent neural networks

S Adavanne, A Politis, J Nikunen… - IEEE Journal of …, 2018 - ieeexplore.ieee.org
In this paper, we propose a convolutional recurrent neural network for joint sound event
localization and detection (SELD) of multiple overlapping sound events in three-dimensional …

Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network

S Adavanne, A Politis, T Virtanen - 2018 26th European Signal …, 2018 - ieeexplore.ieee.org
This paper proposes a deep neural network for estimating the directions of arrival (DOA) of
multiple sound sources. The proposed stacked convolutional and recurrent neural network …

[HTML][HTML] Deep neural network models of sound localization reveal how perception is adapted to real-world environments

A Francl, JH McDermott - Nature human behaviour, 2022 - nature.com
Mammals localize sounds using information from their two ears. Localization in real-world
conditions is challenging, as echoes provide erroneous information and noises mask parts …

SELD-TCN: Sound event localization & detection via temporal convolutional networks

K Guirguis, C Schorn, A Guntoro… - 2020 28th European …, 2021 - ieeexplore.ieee.org
The understanding of the surrounding environment plays a critical role in autonomous
robotic systems, such as self-driving cars. Extensive research has been carried out …

Audio-visual cross-attention network for robotic speaker tracking

X Qian, Z Wang, J Wang, G Guan… - IEEE/ACM Transactions …, 2022 - ieeexplore.ieee.org
Audio-visual signals can be used jointly for robotic perception as they complement each
other. Such multi-modal sensory fusion has a clear advantage, especially under noisy …

Sound localization based on phase difference enhancement using deep neural networks

J Pak, JW Shin - IEEE/ACM Transactions on Audio, Speech …, 2019 - ieeexplore.ieee.org
The performance of most of the classical sound source localization algorithms degrades
seriously in the presence of background noise or reverberation. Recently, deep neural …

First order ambisonics domain spatial augmentation for DNN-based direction of arrival estimation

L Mazzon, Y Koizumi, M Yasuda, N Harada - arXiv preprint arXiv …, 2019 - arxiv.org
In this paper, we propose a novel data augmentation method for training neural networks for
Direction of Arrival (DOA) estimation. This method focuses on expanding the representation …

Sound source distance estimation in diverse and dynamic acoustic conditions

SS Kushwaha, IR Roman, M Fuentes… - 2023 IEEE Workshop …, 2023 - ieeexplore.ieee.org
Localizing a moving sound source in the real world involves determining its direction-of-
arrival (DOA) and distance relative to a microphone. Advancements in DOA estimation have …

Multitask learning of time-frequency CNN for sound source localization

C Pang, H Liu, X Li - IEEE Access, 2019 - ieeexplore.ieee.org
Sound source localization (SSL) is an important technique for many audio processing
systems, such as speech enhancement/recognition and human-robot interaction. Although …