SoundLoc3D: Invisible 3D Sound Source Localization and Classification Using a Multimodal RGB-D Acoustic Camera

Y He, S Shin, A Cherian, A Markham - arXiv preprint arXiv:2412.16861, 2024 - arxiv.org
Accurately localizing 3D sound sources and estimating their semantic labels--where the
sources may not be visible, but are assumed to lie on the physical surface of objects in the …

Feature aggregation in joint sound classification and localization neural networks

B Healy, P McNamee, ZN Ahmadabadi - IEEE Access, 2024 - ieeexplore.ieee.org
Current state-of-the-art sound source localization (SSL) deep learning networks lack feature
aggregation within their architecture. Feature aggregation within neural network …

Spatial audio and spatial audio-visual learning

Y He - 2024 - ora.ox.ac.uk
As humans, we extensively depend on multimodal signals to perceive, interact with, and
analyze our surrounding 3D spatial environment, so as to accomplish various complex …