Automatic speech recognition: Systematic literature review

S Alharbi, M Alrazgan, A Alrashed, T Alnomasi… - Ieee …, 2021 - ieeexplore.ieee.org
A huge amount of research has been done in the field of speech signal processing in recent
years. In particular, there has been increasing interest in the automatic speech recognition …

Feature extraction methods in language identification: a survey

D Deshwal, P Sangwan, D Kumar - Wireless Personal Communications, 2019 - Springer
Abstract Language Identification (LI) is one of the widely emerging field in the areas of
speech processing to accurately identify the language from the data base based on some …

Predicting speech intelligibility with deep neural networks

C Spille, SD Ewert, B Kollmeier, BT Meyer - Computer Speech & Language, 2018 - Elsevier
An accurate objective prediction of human speech intelligibility is of interest for many
applications such as the evaluation of signal processing algorithms. To predict the speech …

MetricGAN+/-: Increasing robustness of noise reduction on unseen data

G Close, T Hain, S Goetze - 2022 30th European Signal …, 2022 - ieeexplore.ieee.org
Training of speech enhancement systems often does not incorporate knowledge of human
perception and thus can lead to unnatural sounding results. Incorporating …

Att-TasNet: Attending to Encodings in Time-Domain Audio Speech Separation of Noisy, Reverberant Speech Mixtures

W Ravenscroft, S Goetze, T Hain - Frontiers in Signal Processing, 2022 - frontiersin.org
Separation of speech mixtures in noisy and reverberant environments remains a
challenging task for state-of-the-art speech separation systems. Time-domain audio speech …

On data sampling strategies for training neural network speech separation models

W Ravenscroft, S Goetze, T Hain - 2023 31st European Signal …, 2023 - ieeexplore.ieee.org
Speech separation remains an important area of multi-speaker signal processing. Deep
neural network (DNN) models have attained the best performance on many speech …

The effect of spoken language on speech enhancement using self-supervised speech representation loss functions

G Close, T Hain, S Goetze - … of Signal Processing to Audio and …, 2023 - ieeexplore.ieee.org
Recent work in the field of speech enhancement (SE) has involved the use of self-
supervised speech representations (SSSRs) as feature transformations in loss functions …

Multimodal authentication system based on audio-visual data: a review

S Debnath, K Ramalakshmi… - … for Advancement in …, 2022 - ieeexplore.ieee.org
Security has become a major concern in this era of fast-growing applications. In such an
authentication scheme, the multimodal verification system is crucial. An essential …

Audio-visual automatic speech recognition using PZM, MFCC and statistical analysis

S Debnath, P Roy - 2021 - reunir.unir.net
Audio-Visual Automatic Speech Recognition (AV-ASR) has become the most promising
research area when the audio signal gets corrupted by noise. The main objective of this …

Group Attack Dingo Optimizer for enhancing speech recognition in noisy environments

TNM Kumar, KG Kumar, KT Deepak… - The European Physical …, 2023 - Springer
The speech recognition system has become a vital technology enabling seamless human–
computer interactions, even in noisy public places. To enhance the performance of various …