[HTML][HTML] A survey of sound source localization with deep learning methods
This article is a survey of deep learning methods for single and multiple sound source
localization, with a focus on sound source localization in indoor environments, where …
localization, with a focus on sound source localization in indoor environments, where …
Sound event localization and detection of overlapping sources using convolutional recurrent neural networks
In this paper, we propose a convolutional recurrent neural network for joint sound event
localization and detection (SELD) of multiple overlapping sound events in three-dimensional …
localization and detection (SELD) of multiple overlapping sound events in three-dimensional …
Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network
This paper proposes a deep neural network for estimating the directions of arrival (DOA) of
multiple sound sources. The proposed stacked convolutional and recurrent neural network …
multiple sound sources. The proposed stacked convolutional and recurrent neural network …
[HTML][HTML] Deep neural network models of sound localization reveal how perception is adapted to real-world environments
A Francl, JH McDermott - Nature human behaviour, 2022 - nature.com
Mammals localize sounds using information from their two ears. Localization in real-world
conditions is challenging, as echoes provide erroneous information and noises mask parts …
conditions is challenging, as echoes provide erroneous information and noises mask parts …
SELD-TCN: Sound event localization & detection via temporal convolutional networks
The understanding of the surrounding environment plays a critical role in autonomous
robotic systems, such as self-driving cars. Extensive research has been carried out …
robotic systems, such as self-driving cars. Extensive research has been carried out …
Audio-visual cross-attention network for robotic speaker tracking
Audio-visual signals can be used jointly for robotic perception as they complement each
other. Such multi-modal sensory fusion has a clear advantage, especially under noisy …
other. Such multi-modal sensory fusion has a clear advantage, especially under noisy …
Sound localization based on phase difference enhancement using deep neural networks
J Pak, JW Shin - IEEE/ACM Transactions on Audio, Speech …, 2019 - ieeexplore.ieee.org
The performance of most of the classical sound source localization algorithms degrades
seriously in the presence of background noise or reverberation. Recently, deep neural …
seriously in the presence of background noise or reverberation. Recently, deep neural …
First order ambisonics domain spatial augmentation for DNN-based direction of arrival estimation
In this paper, we propose a novel data augmentation method for training neural networks for
Direction of Arrival (DOA) estimation. This method focuses on expanding the representation …
Direction of Arrival (DOA) estimation. This method focuses on expanding the representation …
Sound source distance estimation in diverse and dynamic acoustic conditions
Localizing a moving sound source in the real world involves determining its direction-of-
arrival (DOA) and distance relative to a microphone. Advancements in DOA estimation have …
arrival (DOA) and distance relative to a microphone. Advancements in DOA estimation have …
Multitask learning of time-frequency CNN for sound source localization
Sound source localization (SSL) is an important technique for many audio processing
systems, such as speech enhancement/recognition and human-robot interaction. Although …
systems, such as speech enhancement/recognition and human-robot interaction. Although …