Automatic speech recognition: Systematic literature review

S Alharbi, M Alrazgan, A Alrashed, T Alnomasi… - Ieee …, 2021 - ieeexplore.ieee.org
A huge amount of research has been done in the field of speech signal processing in recent
years. In particular, there has been increasing interest in the automatic speech recognition …

End-to-end waveform utterance enhancement for direct evaluation metrics optimization by fully convolutional neural networks

SW Fu, TW Wang, Y Tsao, X Lu… - IEEE/ACM Transactions …, 2018 - ieeexplore.ieee.org
Speech enhancement model is used to map a noisy speech to a clean speech. In the
training stage, an objective function is often adopted to optimize the model parameters …

A cross-entropy-guided measure (CEGM) for assessing speech recognition performance and optimizing DNN-based speech enhancement

L Chai, J Du, QF Liu, CH Lee - IEEE/ACM Transactions on …, 2020 - ieeexplore.ieee.org
A new cross-entropy-guided measure (CEGM) is proposed to indirectly assess accuracies of
automatic speech recognition (ASR) of degraded speech with a speech enhancement front …

A flow-based neural network for time domain speech enhancement

M Strauss, B Edler - ICASSP 2021-2021 IEEE International …, 2021 - ieeexplore.ieee.org
Speech enhancement involves the distinction of a target speech signal from an intrusive
background. Although generative approaches using Variational Autoencoders or Generative …

Automatic speech recognition systems: A survey of discriminative techniques

AP Kaur, A Singh, R Sachdeva… - Multimedia Tools and …, 2023 - search.proquest.com
In the subject of pattern recognition, speech recognition is an important study topic. The
authors give a detailed assessment of voice recognition strategies for several majority …

Single-channel speech enhancement using implicit Wiener filter for high-quality speech communication

RK Jaiswal, SR Yeduri, LR Cenkeramaddi - International Journal of …, 2022 - Springer
Speech enables easy human-to-human communication as well as human-to-machine
interaction. However, the quality of speech degrades due to background noise in the …

Reinforcement learning based speech enhancement for robust speech recognition

YL Shen, CY Huang, SS Wang, Y Tsao… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
Conventional deep neural network (DNN)-based speech enhancement (SE) approaches
aim to minimize the mean square error (MSE) between enhanced speech and clean …

Computational intelligence in processing of speech acoustics: a survey

A Singh, N Kaur, V Kukreja, V Kadyan… - Complex & Intelligent …, 2022 - Springer
Speech recognition of a language is a key area in the field of pattern recognition. This paper
presents a comprehensive survey on the speech recognition techniques for non-Indian and …

An Automatic Speech Recognition System: A systematic review and Future directions

A Gupta, R Kumar, Y Kumar - 2022 4th International …, 2022 - ieeexplore.ieee.org
The process of recognizing a person through his/her speech signals can be termed Speech
recognition and the model used to do this task is known as the speech recognition system. It …

FLGCNN: A novel fully convolutional neural network for end-to-end monaural speech enhancement with utterance-based objective functions

Y Zhu, X Xu, Z Ye - Applied Acoustics, 2020 - Elsevier
This paper proposes a novel fully convolutional neural network (FCN) called FLGCNN to
address the end-to-end speech enhancement in time domain. The proposed FLGCNN is …