Multi-channel speech enhancement and amplitude modulation analysis for noise robust automatic...

S Alharbi, M Alrazgan, A Alrashed, T Alnomasi… - Ieee …, 2021 - ieeexplore.ieee.org

A huge amount of research has been done in the field of speech signal processing in recent
years. In particular, there has been increasing interest in the automatic speech recognition …

被引用次数：90 相关文章所有 4 个版本

Feature extraction methods in language identification: a survey

D Deshwal, P Sangwan, D Kumar - Wireless Personal Communications, 2019 - Springer

Abstract Language Identification (LI) is one of the widely emerging field in the areas of
speech processing to accurately identify the language from the data base based on some …

被引用次数：48 相关文章所有 3 个版本

Predicting speech intelligibility with deep neural networks

C Spille, SD Ewert, B Kollmeier, BT Meyer - Computer Speech & Language, 2018 - Elsevier

An accurate objective prediction of human speech intelligibility is of interest for many
applications such as the evaluation of signal processing algorithms. To predict the speech …

被引用次数：93 相关文章所有 2 个版本

[PDF] arxiv.org

MetricGAN+/-: Increasing robustness of noise reduction on unseen data

G Close, T Hain, S Goetze - 2022 30th European Signal …, 2022 - ieeexplore.ieee.org

Training of speech enhancement systems often does not incorporate knowledge of human
perception and thus can lead to unnatural sounding results. Incorporating …

被引用次数：13 相关文章所有 7 个版本

[PDF] frontiersin.org

Att-TasNet: Attending to Encodings in Time-Domain Audio Speech Separation of Noisy, Reverberant Speech Mixtures

W Ravenscroft, S Goetze, T Hain - Frontiers in Signal Processing, 2022 - frontiersin.org

Separation of speech mixtures in noisy and reverberant environments remains a
challenging task for state-of-the-art speech separation systems. Time-domain audio speech …

被引用次数：10 相关文章所有 2 个版本

[PDF] arxiv.org

On data sampling strategies for training neural network speech separation models

W Ravenscroft, S Goetze, T Hain - 2023 31st European Signal …, 2023 - ieeexplore.ieee.org

Speech separation remains an important area of multi-speaker signal processing. Deep
neural network (DNN) models have attained the best performance on many speech …

被引用次数：5 相关文章所有 6 个版本

[PDF] arxiv.org

The effect of spoken language on speech enhancement using self-supervised speech representation loss functions

G Close, T Hain, S Goetze - … of Signal Processing to Audio and …, 2023 - ieeexplore.ieee.org

Recent work in the field of speech enhancement (SE) has involved the use of self-
supervised speech representations (SSSRs) as feature transformations in loss functions …

被引用次数：4 相关文章所有 4 个版本

Multimodal authentication system based on audio-visual data: a review

S Debnath, K Ramalakshmi… - … for Advancement in …, 2022 - ieeexplore.ieee.org

Security has become a major concern in this era of fast-growing applications. In such an
authentication scheme, the multimodal verification system is crucial. An essential …

被引用次数：9 相关文章所有 3 个版本

[PDF] unir.net

Audio-visual automatic speech recognition using PZM, MFCC and statistical analysis

S Debnath, P Roy - 2021 - reunir.unir.net

Audio-Visual Automatic Speech Recognition (AV-ASR) has become the most promising
research area when the audio signal gets corrupted by noise. The main objective of this …

被引用次数：9 相关文章所有 5 个版本

Group Attack Dingo Optimizer for enhancing speech recognition in noisy environments

TNM Kumar, KG Kumar, KT Deepak… - The European Physical …, 2023 - Springer

The speech recognition system has become a vital technology enabling seamless human–
computer interactions, even in noisy public places. To enhance the performance of various …