SoundLoc3D: Invisible 3D Sound Source Localization and Classification Using a Multimodal RGB-D Acoustic Camera
Accurately localizing 3D sound sources and estimating their semantic labels--where the
sources may not be visible, but are assumed to lie on the physical surface of objects in the …
sources may not be visible, but are assumed to lie on the physical surface of objects in the …
Feature aggregation in joint sound classification and localization neural networks
B Healy, P McNamee, ZN Ahmadabadi - IEEE Access, 2024 - ieeexplore.ieee.org
Current state-of-the-art sound source localization (SSL) deep learning networks lack feature
aggregation within their architecture. Feature aggregation within neural network …
aggregation within their architecture. Feature aggregation within neural network …
Spatial audio and spatial audio-visual learning
Y He - 2024 - ora.ox.ac.uk
As humans, we extensively depend on multimodal signals to perceive, interact with, and
analyze our surrounding 3D spatial environment, so as to accomplish various complex …
analyze our surrounding 3D spatial environment, so as to accomplish various complex …