Survey of deep learning paradigms for speech processing

KB Bhangale, M Kothandaraman - Wireless Personal Communications, 2022 - Springer
Over the past decades, a particular focus is given to research on machine learning
techniques for speech processing applications. However, in the past few years, research …

Speech technology progress based on new machine learning paradigm

V Delić, Z Perić, M Sečujski… - Computational …, 2019 - Wiley Online Library
Speech technologies have been developed for decades as a typical signal processing area,
while the last decade has brought a huge progress based on new machine learning …

Robust acoustic scene classification using a multi-spectrogram encoder-decoder framework

L Pham, H Phan, T Nguyen, R Palaniappan… - Digital Signal …, 2021 - Elsevier
This article proposes an encoder-decoder network model for Acoustic Scene Classification
(ASC), the task of identifying the scene of an audio recording from its acoustic signature. We …

Micro-Doppler radar classification of humans and animals in an operational environment

WD Van Eeden, JP De Villiers, RJ Berndt… - Expert Systems with …, 2018 - Elsevier
A combined Gaussian mixture model and hidden Markov model (HMM) is developed to
distinguish between slow moving animal and human targets using mel-cepstrum …

[PDF][PDF] A Robust Framework for Acoustic Scene Classification.

LD Pham, I McLoughlin, H Phan, R Palaniappan - INTERSPEECH, 2019 - isca-archive.org
Acoustic scene classification (ASC) using front-end timefrequency features and back-end
neural network classifiers has demonstrated good performance in recent years. However a …

Small vocabulary isolated-word automatic speech recognition for single-word commands in Arabic spoken

M Obaid, R Hodrob, A Abu Mwais, M Aldababsa - Soft Computing, 2023 - Springer
Research into automated speech recognition (ASR) for the Arabic language has been
steadily increasing due to its potential for great growth. In this paper, we implemented …

MFCC in audio signal processing for voice disorder: a review

MS Sidhu, NAA Latib, KK Sidhu - Multimedia Tools and Applications, 2024 - Springer
Abstract Voice Disorder or Dysphonia has caught the attention of audio signal process
engineers and researchers. The efficiency of several feature extraction and classifier …

Time–frequency feature fusion for noise robust audio event classification

I McLoughlin, Z Xie, Y Song, H Phan… - Circuits, Systems, and …, 2020 - Springer
This paper explores the use of three different two-dimensional time–frequency features for
audio event classification with deep neural network back-end classifiers. The evaluations …

[HTML][HTML] Prenatal auditory stimulation induces physiological stress responses in developing embryos and newly hatched chicks

SA Hanafi, I Zulkifli, SK Ramiah, ELT Chung, R Kamil… - Poultry Science, 2023 - Elsevier
Prenatal stress may evoke considerable physiological consequences on the developing
poultry embryos and neonates. The present study aimed to determine prenatal auditory …

A spectral glottal flow model for source-filter separation of speech

O Perrotin, I McLoughlin - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
The estimation of glottal flow from a speech waveform is an essential technique used in
speech analysis and parameterisation. Significant research effort has been addressed at …