Revising Perceptual Linear Prediction (PLP).

A review of physical and perceptual feature extraction techniques for speech, music and environmental sounds

F Alías, JC Socoró, X Sevillano - Applied Sciences, 2016 - mdpi.com

Endowing machines with sensing capabilities similar to those of humans is a prevalent
quest in engineering and computer science. In the pursuit of making computers sense their …

被引用次数：307 相关文章所有 11 个版本

[PDF] upm.es

On the design of automatic voice condition analysis systems. Part I: Review of concepts and an insight to the state of the art

JA Gómez-García, L Moro-Velázquez… - … Signal Processing and …, 2019 - Elsevier

This is the first of a two-part series devoted to review the current state of the art of automatic
voice condition analysis systems. The goal of this paper is to provide to the scientific …

被引用次数：69 相关文章所有 2 个版本

[PDF] academia.edu

[PDF][PDF] Feature extraction methods LPC, PLP and MFCC in speech recognition

N Dave - International journal for advance research in …, 2013 - academia.edu

The automatic recognition of speech, enabling a natural and easy to use method of
communication between human and machine, is an active area of research. Speech …

被引用次数：546 相关文章所有 3 个版本

[PDF] idiap.ch

End-to-end acoustic modeling using convolutional neural networks for HMM-based automatic speech recognition

D Palaz, M Magimai-Doss, R Collobert - Speech Communication, 2019 - Elsevier

In hidden Markov model (HMM) based automatic speech recognition (ASR) system,
modeling the statistical relationship between the acoustic speech signal and the HMM states …

被引用次数：174 相关文章所有 9 个版本

ASRTest: automated testing for deep-neural-network-driven speech recognition systems

P Ji, Y Feng, J Liu, Z Zhao, Z Chen - Proceedings of the 31st ACM …, 2022 - dl.acm.org

With the rapid development of deep neural networks and end-to-end learning techniques,
automatic speech recognition (ASR) systems have been deployed into our daily and assist …

被引用次数：19 相关文章

[PDF] psu.edu

Multitaper MFCC and PLP features for speaker verification using i-vectors

MJ Alam, T Kinnunen, P Kenny, P Ouellet… - Speech …, 2013 - Elsevier

In this paper we study the performance of the low-variance multi-taper Mel-frequency
cepstral coefficient (MFCC) and perceptual linear prediction (PLP) features in a state-of-the …

被引用次数：120 相关文章所有 7 个版本

[图书][B] Analysis of speech of people with Parkinson's disease

JR Orozco-Arroyave - 2016 - books.google.com

The analysis of speech of people with Parkinson's disease is an interesting and highly
relevant topic that has attracted the research community during several years. The advances …

被引用次数：66 相关文章

[PDF] degruyter.com

A review on voice pathology: Taxonomy, diagnosis, medical procedures and detection techniques, open challenges, limitations, and recommendations for future …

NQ Abdulmajeed, B Al-Khateeb… - Journal of Intelligent …, 2022 - degruyter.com

Speech is a primary means of human communication and one of the most basic features of
human conduct. Voice is an important part of its subsystems. A speech disorder is a …

被引用次数：23 相关文章所有 6 个版本

In domain training data augmentation on noise robust Punjabi Children speech recognition

V Kadyan, P Bawa, T Hasija - Journal of Ambient Intelligence and …, 2022 - Springer

For building a successful automatic speech recognition (ASR) engine large training data is
required. It increases training complexity and become impossible for less resource language …

被引用次数：18 相关文章所有 2 个版本

[PDF] google.com

On the design of automatic voice condition analysis systems. Part III: Review of acoustic modelling strategies

JA Gómez-García, L Moro-Velázquez… - … Signal Processing and …, 2021 - Elsevier

This is the third of a three-part series devoted to review the current state of the art of
automatic voice condition analysis systems. A direct continuation to “On the design of …

被引用次数：27 相关文章所有 2 个版本