Acoustic scene classification: a comprehensive survey

B Ding, T Zhang, C Wang, G Liu, J Liang, R Hu… - Expert Systems with …, 2024 - Elsevier
Acoustic scene classification (ASC) has gained significant interest recently due to its diverse
applications. Various audio signal processing and machine learning methods have been …

Rethinking CNN models for audio classification

K Palanisamy, D Singhania, A Yao - arXiv preprint arXiv:2007.11154, 2020 - arxiv.org
In this paper, we show that ImageNet-Pretrained standard deep CNN models can be used
as strong baseline networks for audio classification. Even though there is a significant …

Comparison of pre-trained CNNs for audio classification using transfer learning

E Tsalera, A Papadakis, M Samarakou - Journal of Sensor and Actuator …, 2021 - mdpi.com
The paper investigates retraining options and the performance of pre-trained Convolutional
Neural Networks (CNNs) for sound classification. CNNs were initially designed for image …

Esresnet: Environmental sound classification based on visual domain models

A Guzhov, F Raue, J Hees… - 2020 25th international …, 2021 - ieeexplore.ieee.org
Environmental Sound Classification (ESC) is an active research area in the audio domain
and has seen a lot of progress in the past years. However, many of the existing approaches …

Masked spectrogram prediction for self-supervised audio pre-training

D Chong, H Wang, P Zhou… - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Transformer-based models attain excellent results and generalize well when trained on
sufficient amounts of data. However, constrained by the limited data available in the audio …

Specaugment++: A hidden space data augmentation method for acoustic scene classification

H Wang, Y Zou, W Wang - arXiv preprint arXiv:2103.16858, 2021 - arxiv.org
In this paper, we present SpecAugment++, a novel data augmentation method for deep
neural networks based acoustic scene classification (ASC). Different from other popular data …

Environment sound classification using an attention-based residual neural network

AM Tripathi, A Mishra - Neurocomputing, 2021 - Elsevier
Complexity of environmental sounds impose numerous challenges for their classification.
The performance of Environmental Sound Classification (ESC) depends greatly on how …

Improving the performance of automated audio captioning via integrating the acoustic and semantic information

Z Ye, H Wang, D Yang, Y Zou - arXiv preprint arXiv:2110.06100, 2021 - arxiv.org
Automated audio captioning (AAC) has developed rapidly in recent years, involving acoustic
signal processing and natural language processing to generate human-readable sentences …

A deep learning approach for detecting drill bit failures from a small sound dataset

T Tran, NT Pham, J Lundgren - Scientific Reports, 2022 - nature.com
Monitoring the conditions of machines is vital in the manufacturing industry. Early detection
of faulty components in machines for stopping and repairing the failed components can …

Attentional graph convolutional network for structure-aware audiovisual scene classification

L Zhou, Y Zhou, X Qi, J Hu, TL Lam… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Audiovisual scene understanding is a challenging problem due to the unstructured spatial–
temporal relations that exist in the audio signals and spatial layouts of different objects in the …