Automatic speech recognition: Systematic literature review
S Alharbi, M Alrazgan, A Alrashed, T Alnomasi… - Ieee …, 2021 - ieeexplore.ieee.org
A huge amount of research has been done in the field of speech signal processing in recent
years. In particular, there has been increasing interest in the automatic speech recognition …
years. In particular, there has been increasing interest in the automatic speech recognition …
Feature extraction methods in language identification: a survey
Abstract Language Identification (LI) is one of the widely emerging field in the areas of
speech processing to accurately identify the language from the data base based on some …
speech processing to accurately identify the language from the data base based on some …
Predicting speech intelligibility with deep neural networks
An accurate objective prediction of human speech intelligibility is of interest for many
applications such as the evaluation of signal processing algorithms. To predict the speech …
applications such as the evaluation of signal processing algorithms. To predict the speech …
MetricGAN+/-: Increasing robustness of noise reduction on unseen data
Training of speech enhancement systems often does not incorporate knowledge of human
perception and thus can lead to unnatural sounding results. Incorporating …
perception and thus can lead to unnatural sounding results. Incorporating …
Att-TasNet: Attending to Encodings in Time-Domain Audio Speech Separation of Noisy, Reverberant Speech Mixtures
Separation of speech mixtures in noisy and reverberant environments remains a
challenging task for state-of-the-art speech separation systems. Time-domain audio speech …
challenging task for state-of-the-art speech separation systems. Time-domain audio speech …
On data sampling strategies for training neural network speech separation models
Speech separation remains an important area of multi-speaker signal processing. Deep
neural network (DNN) models have attained the best performance on many speech …
neural network (DNN) models have attained the best performance on many speech …
The effect of spoken language on speech enhancement using self-supervised speech representation loss functions
Recent work in the field of speech enhancement (SE) has involved the use of self-
supervised speech representations (SSSRs) as feature transformations in loss functions …
supervised speech representations (SSSRs) as feature transformations in loss functions …
Multimodal authentication system based on audio-visual data: a review
S Debnath, K Ramalakshmi… - … for Advancement in …, 2022 - ieeexplore.ieee.org
Security has become a major concern in this era of fast-growing applications. In such an
authentication scheme, the multimodal verification system is crucial. An essential …
authentication scheme, the multimodal verification system is crucial. An essential …
Audio-visual automatic speech recognition using PZM, MFCC and statistical analysis
Audio-Visual Automatic Speech Recognition (AV-ASR) has become the most promising
research area when the audio signal gets corrupted by noise. The main objective of this …
research area when the audio signal gets corrupted by noise. The main objective of this …
Group Attack Dingo Optimizer for enhancing speech recognition in noisy environments
The speech recognition system has become a vital technology enabling seamless human–
computer interactions, even in noisy public places. To enhance the performance of various …
computer interactions, even in noisy public places. To enhance the performance of various …