Audio-visual spatial integration and recursive attention for robust sound source localization

文章

学术资源搜索

获得 3 条结果（用时0.02秒）

我的图书馆

Audio-visual spatial integration and recursive attention for robust sound source localization

在引用文章中搜索

[PDF] arxiv.org

Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality

KR Park, HJ Lee, JU Kim - European Conference on Computer Vision, 2025 - Springer

Abstract Recent Audio-Visual Question Answering (AVQA) methods rely on complete visual
and audio input to answer questions accurately. However, in real-world scenarios, issues …

被引用次数：1 相关文章所有 7 个版本

[PDF] thecvf.com

Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge

D Kim, SJ Um, S Lee, JU Kim - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

The goal of the multi-sound source localization task is to localize sound sources from the
mixture individually. While recent multi-sound source localization methods have shown …

被引用次数：2 相关文章所有 3 个版本

Enhancing Audio-Visual Question Answering with Missing Modality via Trans-Modal Associative Learning

KR Park, Y Oh, JU Kim - ICASSP 2024-2024 IEEE International …, 2024 - ieeexplore.ieee.org

We present a novel method for Audio-Visual Question Answering (AVQA) in real-world
scenarios where one modality (audio or visual) can be missing. Inspired by human cognitive …

被引用次数：1 相关文章