Speech and Audio Processing: a MATLAB-based approach

KB Bhangale, M Kothandaraman - Wireless Personal Communications, 2022 - Springer

Over the past decades, a particular focus is given to research on machine learning
techniques for speech processing applications. However, in the past few years, research …

被引用次数：93 相关文章所有 5 个版本

[PDF] wiley.com Full View

Speech technology progress based on new machine learning paradigm

V Delić, Z Perić, M Sečujski… - Computational …, 2019 - Wiley Online Library

Speech technologies have been developed for decades as a typical signal processing area,
while the last decade has brought a huge progress based on new machine learning …

被引用次数：100 相关文章所有 12 个版本

[PDF] arxiv.org

Robust acoustic scene classification using a multi-spectrogram encoder-decoder framework

L Pham, H Phan, T Nguyen, R Palaniappan… - Digital Signal …, 2021 - Elsevier

This article proposes an encoder-decoder network model for Acoustic Scene Classification
(ASC), the task of identifying the scene of an audio recording from its acoustic signature. We …

被引用次数：45 相关文章所有 17 个版本

[PDF] sciencedirect.com

Micro-Doppler radar classification of humans and animals in an operational environment

WD Van Eeden, JP De Villiers, RJ Berndt… - Expert Systems with …, 2018 - Elsevier

A combined Gaussian mixture model and hidden Markov model (HMM) is developed to
distinguish between slow moving animal and human targets using mel-cepstrum …

被引用次数：67 相关文章所有 3 个版本

[PDF] isca-archive.org

[PDF][PDF] A Robust Framework for Acoustic Scene Classification.

LD Pham, I McLoughlin, H Phan, R Palaniappan - INTERSPEECH, 2019 - isca-archive.org

Acoustic scene classification (ASC) using front-end timefrequency features and back-end
neural network classifiers has demonstrated good performance in recent years. However a …

被引用次数：33 相关文章所有 8 个版本

Small vocabulary isolated-word automatic speech recognition for single-word commands in Arabic spoken

M Obaid, R Hodrob, A Abu Mwais, M Aldababsa - Soft Computing, 2023 - Springer

Research into automated speech recognition (ASR) for the Arabic language has been
steadily increasing due to its potential for great growth. In this paper, we implemented …

被引用次数：7 相关文章所有 2 个版本

MFCC in audio signal processing for voice disorder: a review

MS Sidhu, NAA Latib, KK Sidhu - Multimedia Tools and Applications, 2024 - Springer

Abstract Voice Disorder or Dysphonia has caught the attention of audio signal process
engineers and researchers. The efficiency of several feature extraction and classifier …

被引用次数：8 相关文章

[PDF] springer.com

Time–frequency feature fusion for noise robust audio event classification

I McLoughlin, Z Xie, Y Song, H Phan… - Circuits, Systems, and …, 2020 - Springer

This paper explores the use of three different two-dimensional time–frequency features for
audio event classification with deep neural network back-end classifiers. The evaluations …

被引用次数：24 相关文章所有 11 个版本

[HTML] sciencedirect.com

[HTML][HTML] Prenatal auditory stimulation induces physiological stress responses in developing embryos and newly hatched chicks

SA Hanafi, I Zulkifli, SK Ramiah, ELT Chung, R Kamil… - Poultry Science, 2023 - Elsevier

Prenatal stress may evoke considerable physiological consequences on the developing
poultry embryos and neonates. The present study aimed to determine prenatal auditory …

被引用次数：7 相关文章所有 9 个版本

[PDF] arxiv.org

A spectral glottal flow model for source-filter separation of speech

O Perrotin, I McLoughlin - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org

The estimation of glottal flow from a speech waveform is an essential technique used in
speech analysis and parameterisation. Significant research effort has been addressed at …

被引用次数：28 相关文章所有 8 个版本