Ast-sed: An effective sound event detection method based on audio spectrogram transformer

K Li, Y Song, LR Dai, I McLoughlin… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
In this paper, we propose an effective sound event detection (SED) method based on the
audio spectrogram transformer (AST) model, pretrained on the large-scale AudioSet for …

[PDF][PDF] Li USTC team's submission for DCASE 2023 challenge task4a

K Li, P Cai, Y Song - Tech. Rep., DCASE2023 Challenge, 2023 - dcase.community
In this technical report, we present our submissions for DCASE 2023 challenge task4a. We
mainly study how to fine-tune patchout fast spectrogram transformer (PaSST) for sound …

A multi-task learning framework for sound event detection using high-level acoustic characteristics of sounds

T Khandelwal, RK Das - arXiv preprint arXiv:2305.10729, 2023 - arxiv.org
Sound event detection (SED) entails identifying the type of sound and estimating its
temporal boundaries from acoustic signals. These events are uniquely characterized by their …

Post-processing independent evaluation of sound event detection systems

J Ebbers, R Haeb-Umbach, R Serizel - arXiv preprint arXiv:2306.15440, 2023 - arxiv.org
Due to the high variation in the application requirements of sound event detection (SED)
systems, it is not sufficient to evaluate systems only in a single operating mode. Therefore …

[PDF][PDF] Sound Event Detection: A Journey Through DCASE Challenge Series

T Khandelwal, RK Das, ES Chng - APSIPA Transactions on …, 2024 - nowpublishers.com
The sense of hearing is fundamental to human beings, as it allows them to perceive their
surroundings. However, this simple task of recognizing different sounds in complex …

[PDF][PDF] Pepe: Plain efficient pretrained embeddings for sound event detection

Y Wang, H Dinkel, Z Yan, J Zhang, Y Wang - 2023 - dcase.community
This paper is a system description of the XiaoRice team submission to the DCASE 2023
Task 4 challenge. In light of the increasing availability of pretrained audio embedding …

[PDF][PDF] Semi-supervised sound event detection system for DCASE 2023 task4a

X Duo, W Fang, J Li - 2023 - dcase.community
In this technical report, we describe our systems for DCASE 2023 Challenge Task4a. Our
systems are mainly based on Frequency Dynamic Convolutional Recurrent Neural Network …

Improving Audio Spectrogram Transformers for Sound Event Detection Through Multi-Stage Training

F Schmid, P Primus, T Morocutti, J Greif… - arXiv preprint arXiv …, 2024 - arxiv.org
This technical report describes the CP-JKU team's submission for Task 4 Sound Event
Detection with Heterogeneous Training Datasets and Potentially Missing Labels of the …

Auditory Neural Response Inspired Sound Event Detection Based on Spectro-temporal Receptive Field

D Min, H Nam, YH Park - arXiv preprint arXiv:2306.11427, 2023 - arxiv.org
Sound event detection (SED) is one of tasks to automate function by human auditory system
which listens and understands auditory scenes. Therefore, we were inspired to make SED …

[PDF][PDF] DCASE 2023 challenge task4 technical report

M Chen, Y Jin, J Shao, Y Liu, B Peng, J Chen - 2023 - dcase.community
We describe our submitted systems for DCASE2023 Task4 in this technical report: Sound
Event Detection with Weak Labels and Synthetic Soundscapes (Subtask A), and Sound …