A review of physical and perceptual feature extraction techniques for speech, music and environmental sounds

F Alías, JC Socoró, X Sevillano - Applied Sciences, 2016 - mdpi.com
Endowing machines with sensing capabilities similar to those of humans is a prevalent
quest in engineering and computer science. In the pursuit of making computers sense their …

On the design of automatic voice condition analysis systems. Part I: Review of concepts and an insight to the state of the art

JA Gómez-García, L Moro-Velázquez… - … Signal Processing and …, 2019 - Elsevier
This is the first of a two-part series devoted to review the current state of the art of automatic
voice condition analysis systems. The goal of this paper is to provide to the scientific …

[PDF][PDF] Feature extraction methods LPC, PLP and MFCC in speech recognition

N Dave - International journal for advance research in …, 2013 - academia.edu
The automatic recognition of speech, enabling a natural and easy to use method of
communication between human and machine, is an active area of research. Speech …

End-to-end acoustic modeling using convolutional neural networks for HMM-based automatic speech recognition

D Palaz, M Magimai-Doss, R Collobert - Speech Communication, 2019 - Elsevier
In hidden Markov model (HMM) based automatic speech recognition (ASR) system,
modeling the statistical relationship between the acoustic speech signal and the HMM states …

ASRTest: automated testing for deep-neural-network-driven speech recognition systems

P Ji, Y Feng, J Liu, Z Zhao, Z Chen - Proceedings of the 31st ACM …, 2022 - dl.acm.org
With the rapid development of deep neural networks and end-to-end learning techniques,
automatic speech recognition (ASR) systems have been deployed into our daily and assist …

Multitaper MFCC and PLP features for speaker verification using i-vectors

MJ Alam, T Kinnunen, P Kenny, P Ouellet… - Speech …, 2013 - Elsevier
In this paper we study the performance of the low-variance multi-taper Mel-frequency
cepstral coefficient (MFCC) and perceptual linear prediction (PLP) features in a state-of-the …

[图书][B] Analysis of speech of people with Parkinson's disease

JR Orozco-Arroyave - 2016 - books.google.com
The analysis of speech of people with Parkinson's disease is an interesting and highly
relevant topic that has attracted the research community during several years. The advances …

A review on voice pathology: Taxonomy, diagnosis, medical procedures and detection techniques, open challenges, limitations, and recommendations for future …

NQ Abdulmajeed, B Al-Khateeb… - Journal of Intelligent …, 2022 - degruyter.com
Speech is a primary means of human communication and one of the most basic features of
human conduct. Voice is an important part of its subsystems. A speech disorder is a …

In domain training data augmentation on noise robust Punjabi Children speech recognition

V Kadyan, P Bawa, T Hasija - Journal of Ambient Intelligence and …, 2022 - Springer
For building a successful automatic speech recognition (ASR) engine large training data is
required. It increases training complexity and become impossible for less resource language …

On the design of automatic voice condition analysis systems. Part III: Review of acoustic modelling strategies

JA Gómez-García, L Moro-Velázquez… - … Signal Processing and …, 2021 - Elsevier
This is the third of a three-part series devoted to review the current state of the art of
automatic voice condition analysis systems. A direct continuation to “On the design of …