Sound Event Bounding Boxes

J Ebbers, FG Germain, G Wichern, JL Roux - arXiv preprint arXiv …, 2024 - arxiv.org
… detection (SED) [1, 2] tasks aim at exhaustively inventorying the sounds in a scene. SED …
of sound events on top of their event class. In mathematical terms, it asks to detect the events ej (…

The Sound of Bounding-Boxes

T Oya, S Iwase, S Morishima - 2022 26th International …, 2022 - ieeexplore.ieee.org
bounding box proposal generator, the bounding box selector, and the audio-visual separator.
The bounding box proposal generator takes an image as input and returns bounding box

Eventness: Object detection on spectrograms for temporal localization of audio events

P Pham, J Li, J Szurley, S Das - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
… Besides classifying the audio event, this component also refines the region bounding box.
The allows for a more accurate bounding box which is derived from a larger view of the event

Objects that sound

R Arandjelovic, A Zisserman - Proceedings of the European …, 2018 - openaccess.thecvf.com
… therefore miss the relevant event, so the ideal nDCG of 1 is highly unlikely to be achievable.
… , to learn object detectors without bounding box annotations in the single visual modality …

Metrics for polyphonic sound event detection

A Mesaros, T Heittola, T Virtanen - Applied Sciences, 2016 - mdpi.com
sounds [11,12,13,14]. A more complex situation deals with detecting sound events in audio
with multiple overlapping soundssound event from the number of concurrent sounds at each …

[HTML][HTML] A safety-oriented framework for sound event detection in driving scenarios

C Castorena, M Cobos, J Lopez-Ballester, FJ Ferri - Applied Acoustics, 2024 - Elsevier
… , taking into account the event classes that are known to be … a fully-convolutional sound
event detection model that was … This comprehensive framework for sound event detection in …

Enhanced Sound Event Localization and Detection in Real 360-degree audio-visual soundscapes

AS Roman, B Balamurugan, R Pothuganti - arXiv preprint arXiv …, 2024 - arxiv.org
… Each bounding box is encoded into two Gaussian-like vectors, corresponding to the … Note
that the baseline model only supports a maximum of M = 6 bounding boxes per frame (ie i = 1..…

Adaptive pooling operators for weakly labeled sound event detection

B McFee, J Salamon, JP Bello - IEEE/ACM Transactions on …, 2018 - ieeexplore.ieee.org
… We evaluate the proposed methods on three multi-label, sound event detection datasets,
which … for inferring object contours in bounding boxes,” IEEE Trans. Image Process., vol. 23, no. …

Self Training and Ensembling Frequency Dependent Networks with Coarse Prediction Pooling and Sound Event Bounding Boxes

H Nam, D Min, S Choi, I Choi, YH Park - arXiv preprint arXiv:2406.15725, 2024 - arxiv.org
… To tackle sound event detection (SED), we propose frequency de… We used change-detection-based
sound event bounding boxes (… pooling, label filtering, sound event bounding boxes

Mixstyle based Domain Generalization for Sound Event Detection with Heterogeneous Training Data

Y Xiao, H Yin, J Bai, RK Das - arXiv preprint arXiv:2407.03654, 2024 - arxiv.org
… Lastly, we use the sound event bounding boxes method for postprocessing. Our approach
integrates features from bidirectional encoder representations from audio transformers and a …