General-purpose tagging of freesound audio with audioset labels: Task description, dataset,...

E Fonseca, X Favory, J Pons, F Font… - IEEE/ACM Transactions …, 2021 - ieeexplore.ieee.org

Most existing datasets for sound event recognition (SER) are relatively small and/or domain-
specific, with the exception of AudioSet, based on over 2 M tracks from YouTube videos and …

被引用次数：407 相关文章所有 5 个版本

Panns: Large-scale pretrained audio neural networks for audio pattern recognition

Q Kong, Y Cao, T Iqbal, Y Wang… - … on Audio, Speech …, 2020 - ieeexplore.ieee.org

Audio pattern recognition is an important research topic in the machine learning area, and
includes several tasks such as audio tagging, acoustic scene classification, music …

被引用次数：1037 相关文章所有 8 个版本

[PDF] hal.science

The internet of audio things: State of the art, vision, and challenges

L Turchet, G Fazekas, M Lagrange… - IEEE internet of …, 2020 - ieeexplore.ieee.org

The Internet of Audio Things (IoAuT) is an emerging research field positioned at the
intersection of the Internet of Things, sound and music computing, artificial intelligence, and …

被引用次数：64 相关文章所有 8 个版本

[PDF] arxiv.org

Anomalous sound detection with machine learning: A systematic review

EC Nunes - arXiv preprint arXiv:2102.07820, 2021 - arxiv.org

Anomalous sound detection (ASD) is the task of identifying whether the sound emitted from
an object is normal or anomalous. In some cases, early detection of this anomaly can …

被引用次数：30 相关文章所有 2 个版本

[PDF] arxiv.org

Cough against covid: Evidence of covid-19 signature in cough sounds

P Bagad, A Dalmia, J Doshi, A Nagrani… - arXiv preprint arXiv …, 2020 - arxiv.org

Testing capacity for COVID-19 remains a challenge globally due to the lack of adequate
supplies, trained personnel, and sample-processing equipment. These problems are even …

被引用次数：134 相关文章所有 7 个版本

[PDF] ieee.org

Strong labeling of sound events using crowdsourced weak labels and annotator competence estimation

I Martín-Morató, A Mesaros - IEEE/ACM transactions on audio …, 2023 - ieeexplore.ieee.org

Crowdsourcing is a popular tool for collecting large amounts of annotated data, but the
specific format of the strong labels necessary for sound event detection is not easily …

被引用次数：32 相关文章所有 5 个版本

[PDF] arxiv.org

Polyphonic sound event detection and localization using a two-stage strategy

Y Cao, Q Kong, T Iqbal, F An, W Wang… - arXiv preprint arXiv …, 2019 - arxiv.org

Sound event detection (SED) and localization refer to recognizing sound events and
estimating their spatial and temporal locations. Using neural networks has become the …

被引用次数：122 相关文章所有 13 个版本

[PDF] arxiv.org

Learning sound event classifiers from web audio with noisy labels

E Fonseca, M Plakal, DPW Ellis, F Font… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org

As sound event classification moves towards larger datasets, issues of label noise become
inevitable. Web sites can supply large volumes of user-contributed audio and metadata, but …

被引用次数：126 相关文章所有 8 个版本

[PDF] arxiv.org

Unsupervised contrastive learning of sound event representations

E Fonseca, D Ortego, K McGuinness… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org

Self-supervised representation learning can mitigate the limitations in recognition tasks with
few manually labeled data but abundant unlabeled data—a common scenario in sound …

被引用次数：71 相关文章所有 8 个版本

[PDF] arxiv.org

Sound event detection and time–frequency segmentation from weakly labelled data

Q Kong, Y Xu, I Sobieraj, W Wang… - … /ACM Transactions on …, 2019 - ieeexplore.ieee.org

Sound event detection (SED) aims to detect when and recognize what sound events happen
in an audio clip. Many supervised SED algorithms rely on strongly labelled data that …

被引用次数：122 相关文章所有 6 个版本