Comparison of techniques for environmental sound recognition

M Crocco, M Cristani, A Trucco, V Murino - ACM Computing Surveys …, 2016 - dl.acm.org

Despite surveillance systems becoming increasingly ubiquitous in our living environment,
automated surveillance, currently based on video sensory modality and machine …

被引用次数：309 相关文章所有 6 个版本

[HTML] sciencedirect.com

[HTML][HTML] Environmental Sound Classification: A descriptive review of the literature

A Bansal, NK Garg - Intelligent systems with applications, 2022 - Elsevier

Automatic environmental sound classification (ESC) is one of the upcoming areas of
research as most of the traditional studies are focused on speech and music signals …

被引用次数：38 相关文章

[PDF] arxiv.org

Comparison of time-frequency representations for environmental sound classification using convolutional neural networks

M Huzaifah - arXiv preprint arXiv:1706.07156, 2017 - arxiv.org

Recent successful applications of convolutional neural networks (CNNs) to audio
classification and speech recognition have motivated the search for better input …

被引用次数：213 相关文章所有 2 个版本

Gammatone cepstral coefficients: Biologically inspired features for non-speech audio classification

X Valero, F Alias - IEEE transactions on multimedia, 2012 - ieeexplore.ieee.org

In the context of non-speech audio recognition and classification for multimedia applications,
it becomes essential to have a set of features able to accurately represent and discriminate …

被引用次数：331 相关文章所有 7 个版本

[PDF] arxiv.org

Histogram of gradients of time–frequency representations for audio scene classification

A Rakotomamonjy, G Gasso - IEEE/ACM Transactions on …, 2014 - ieeexplore.ieee.org

Presents our entry to the Detection and Classification of Acoustic Scenes challenge. The
approach we propose for classifying acoustic scenes is based on transforming the audio …

被引用次数：275 相关文章所有 12 个版本

[PDF] cambridge.org

Environmental sound recognition: A survey

S Chachada, CCJ Kuo - APSIPA Transactions on Signal and …, 2014 - cambridge.org

Although research in audio recognition has traditionally focused on speech and music
signals, the problem of environmental sound recognition (ESR) has received more attention …

被引用次数：261 相关文章所有 7 个版本

[PDF] academia.edu

Features for content-based audio retrieval

D Mitrović, M Zeppelzauer, C Breiteneder - Advances in computers, 2010 - Elsevier

Today, a large number of audio features exists in audio retrieval for different purposes, such
as automatic speech recognition, music information retrieval, audio segmentation, and …

被引用次数：339 相关文章所有 11 个版本

[PDF] researchgate.net

The bag-of-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music

JJ Aucouturier, B Defreville, F Pachet - The Journal of the Acoustical …, 2007 - pubs.aip.org

The “bag-of-frames” approach (BOF) to audio pattern recognition represents signals as the
long-term statistical distribution of their local spectral features. This approach has proved …

被引用次数：331 相关文章所有 12 个版本

Environmental audio scene and sound event recognition for autonomous surveillance: A survey and comparative studies

S Chandrakala, SL Jayalakshmi - ACM Computing Surveys (CSUR), 2019 - dl.acm.org

Monitoring of human and social activities is becoming increasingly pervasive in our living
environment for public security and safety applications. The recognition of suspicious events …

被引用次数：91 相关文章所有 2 个版本

[PDF] springer.com Full View

[PDF][PDF] Background subtraction for automated multisensor surveillance: a comprehensive review

M Cristani, M Farenzena, D Bloisi, V Murino - EURASIP Journal on …, 2010 - Springer

Background subtraction is a widely used operation in the video surveillance, aimed at
separating the expected scene (the background) from the unexpected entities (the …

被引用次数：230 相关文章所有 24 个版本