Audio surveillance: A systematic review

M Crocco, M Cristani, A Trucco, V Murino - ACM Computing Surveys …, 2016 - dl.acm.org
Despite surveillance systems becoming increasingly ubiquitous in our living environment,
automated surveillance, currently based on video sensory modality and machine …

[HTML][HTML] Environmental Sound Classification: A descriptive review of the literature

A Bansal, NK Garg - Intelligent systems with applications, 2022 - Elsevier
Automatic environmental sound classification (ESC) is one of the upcoming areas of
research as most of the traditional studies are focused on speech and music signals …

Comparison of time-frequency representations for environmental sound classification using convolutional neural networks

M Huzaifah - arXiv preprint arXiv:1706.07156, 2017 - arxiv.org
Recent successful applications of convolutional neural networks (CNNs) to audio
classification and speech recognition have motivated the search for better input …

Gammatone cepstral coefficients: Biologically inspired features for non-speech audio classification

X Valero, F Alias - IEEE transactions on multimedia, 2012 - ieeexplore.ieee.org
In the context of non-speech audio recognition and classification for multimedia applications,
it becomes essential to have a set of features able to accurately represent and discriminate …

Histogram of gradients of time–frequency representations for audio scene classification

A Rakotomamonjy, G Gasso - IEEE/ACM Transactions on …, 2014 - ieeexplore.ieee.org
Presents our entry to the Detection and Classification of Acoustic Scenes challenge. The
approach we propose for classifying acoustic scenes is based on transforming the audio …

Environmental sound recognition: A survey

S Chachada, CCJ Kuo - APSIPA Transactions on Signal and …, 2014 - cambridge.org
Although research in audio recognition has traditionally focused on speech and music
signals, the problem of environmental sound recognition (ESR) has received more attention …

Features for content-based audio retrieval

D Mitrović, M Zeppelzauer, C Breiteneder - Advances in computers, 2010 - Elsevier
Today, a large number of audio features exists in audio retrieval for different purposes, such
as automatic speech recognition, music information retrieval, audio segmentation, and …

The bag-of-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music

JJ Aucouturier, B Defreville, F Pachet - The Journal of the Acoustical …, 2007 - pubs.aip.org
The “bag-of-frames” approach (BOF) to audio pattern recognition represents signals as the
long-term statistical distribution of their local spectral features. This approach has proved …

Environmental audio scene and sound event recognition for autonomous surveillance: A survey and comparative studies

S Chandrakala, SL Jayalakshmi - ACM Computing Surveys (CSUR), 2019 - dl.acm.org
Monitoring of human and social activities is becoming increasingly pervasive in our living
environment for public security and safety applications. The recognition of suspicious events …

[PDF][PDF] Background subtraction for automated multisensor surveillance: a comprehensive review

M Cristani, M Farenzena, D Bloisi, V Murino - EURASIP Journal on …, 2010 - Springer
Background subtraction is a widely used operation in the video surveillance, aimed at
separating the expected scene (the background) from the unexpected entities (the …