Survey of deep learning paradigms for speech processing
KB Bhangale, M Kothandaraman - Wireless Personal Communications, 2022 - Springer
Over the past decades, a particular focus is given to research on machine learning
techniques for speech processing applications. However, in the past few years, research …
techniques for speech processing applications. However, in the past few years, research …
Speech technology progress based on new machine learning paradigm
Speech technologies have been developed for decades as a typical signal processing area,
while the last decade has brought a huge progress based on new machine learning …
while the last decade has brought a huge progress based on new machine learning …
Robust acoustic scene classification using a multi-spectrogram encoder-decoder framework
This article proposes an encoder-decoder network model for Acoustic Scene Classification
(ASC), the task of identifying the scene of an audio recording from its acoustic signature. We …
(ASC), the task of identifying the scene of an audio recording from its acoustic signature. We …
Micro-Doppler radar classification of humans and animals in an operational environment
WD Van Eeden, JP De Villiers, RJ Berndt… - Expert Systems with …, 2018 - Elsevier
A combined Gaussian mixture model and hidden Markov model (HMM) is developed to
distinguish between slow moving animal and human targets using mel-cepstrum …
distinguish between slow moving animal and human targets using mel-cepstrum …
[PDF][PDF] A Robust Framework for Acoustic Scene Classification.
Acoustic scene classification (ASC) using front-end timefrequency features and back-end
neural network classifiers has demonstrated good performance in recent years. However a …
neural network classifiers has demonstrated good performance in recent years. However a …
Small vocabulary isolated-word automatic speech recognition for single-word commands in Arabic spoken
Research into automated speech recognition (ASR) for the Arabic language has been
steadily increasing due to its potential for great growth. In this paper, we implemented …
steadily increasing due to its potential for great growth. In this paper, we implemented …
MFCC in audio signal processing for voice disorder: a review
Abstract Voice Disorder or Dysphonia has caught the attention of audio signal process
engineers and researchers. The efficiency of several feature extraction and classifier …
engineers and researchers. The efficiency of several feature extraction and classifier …
Time–frequency feature fusion for noise robust audio event classification
I McLoughlin, Z Xie, Y Song, H Phan… - Circuits, Systems, and …, 2020 - Springer
This paper explores the use of three different two-dimensional time–frequency features for
audio event classification with deep neural network back-end classifiers. The evaluations …
audio event classification with deep neural network back-end classifiers. The evaluations …
[HTML][HTML] Prenatal auditory stimulation induces physiological stress responses in developing embryos and newly hatched chicks
Prenatal stress may evoke considerable physiological consequences on the developing
poultry embryos and neonates. The present study aimed to determine prenatal auditory …
poultry embryos and neonates. The present study aimed to determine prenatal auditory …
A spectral glottal flow model for source-filter separation of speech
O Perrotin, I McLoughlin - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
The estimation of glottal flow from a speech waveform is an essential technique used in
speech analysis and parameterisation. Significant research effort has been addressed at …
speech analysis and parameterisation. Significant research effort has been addressed at …