A novel spectral subtraction scheme for robust speech recognition: spectral subtraction using...

PC Loizou - 2007 - taylorfrancis.com

The first book to provide comprehensive and up-to-date coverage of all major speech
enhancement algorithms proposed in the last two decades, Speech Enhancement: Theory …

被引用次数：3598 相关文章所有 8 个版本

Enhancements in automatic Kannada speech recognition system by background noise elimination and alternate acoustic modelling

G Thimmaraja Yadava, HS Jayanna - International Journal of Speech …, 2020 - Springer

In this paper, the improvements in the recently implemented Kannada speech recognition
system is demonstrated in detail. The Kannada automatic speech recognition (ASR) system …

被引用次数：26 相关文章所有 2 个版本

[PDF] cmu.edu

Robust features in deep-learning-based speech recognition

V Mitra, H Franco, RM Stern, J Van Hout… - New Era for Robust …, 2017 - Springer

Recent progress in deep learning has revolutionized speech recognition research, with
Deep Neural Networks (DNNs) becoming the new state of the art for acoustic modeling …

被引用次数：37 相关文章所有 8 个版本

[PDF] academia.edu

Speech enhancement by combining spectral subtraction and minimum mean square error-spectrum power estimator based on zero crossing

TG Yadava, HS Jayanna - International Journal of Speech Technology, 2019 - Springer

Speech data collected under uncontrolled environment need to be processed to build a
robust automatic speech recognition system. In this paper, a method is proposed to process …

被引用次数：29 相关文章所有 3 个版本

[PDF] researchgate.net

Voice activity detection using harmonic frequency components in likelihood ratio test

LN Tan, BJ Borgstrom, A Alwan - 2010 IEEE International …, 2010 - ieeexplore.ieee.org

This paper proposes a new statistical model-based likelihood ratio test (LRT) VAD to obtain
reliable speech/non-speech decisions. In the proposed method, the likelihood ratio (LR) is …

被引用次数：61 相关文章所有 13 个版本

SASE: Self-Adaptive noise distribution network for Speech Enhancement with Federated Learning using heterogeneous data

Z Lin, B Zeng, H Hu, Y Huang, L Xu, Z Yao - Knowledge-Based Systems, 2023 - Elsevier

Correlation of speech data is ubiquitous in real-world environments. Heterogeneous data
involving different distributions hinder federated learning (FL) significantly, which can lead to …

被引用次数：3 相关文章所有 2 个版本

Voice/non-voice detection using phase of zero frequency filtered speech signal

SBS Kumar, KS Rao - Speech Communication, 2016 - Elsevier

Voice/non-voice detection refers to the task of detecting the presence or absence of vocal
folds activity regions in the speech signal. Most of the existing state-of-the-art methods …

被引用次数：25 相关文章所有 3 个版本

[PDF] academia.edu

A pitch based noise estimation technique for robust speech recognition with missing data

JA Morales-Cordovilla, N Ma, V Sánchez… - … , Speech and Signal …, 2011 - ieeexplore.ieee.org

This paper presents a noise estimation technique based on knowledge of pitch information
for robust speech recognition. In the first stage the noise is estimated by means of …

被引用次数：38 相关文章所有 8 个版本

[PDF] academia.edu

[PDF][PDF] Creation and comparison of language and acoustic models using Kaldi for noisy and enhanced speech data

TG Yadava, HS Jayanna - International Journal of Intelligent Systems …, 2018 - academia.edu

Acoustic Models (AMs) are developed using the speech recognition toolkit Kaldi for noisy
and enhanced speech data to build an Automatic Speech Recognition (ASR) system for …

被引用次数：9 相关文章所有 5 个版本

[PDF] 47.115.32.177

A modified Ephraim-Malah noise suppression rule for automatic speech recognition

R Gemello, F Mana, R De Mori - 2004 IEEE International …, 2004 - ieeexplore.ieee.org

A soft decision gain modification is introduced and applied to the Ephraim-Malah gain
function based on maximum mean square error estimation (MMSE)(Ephraim, Y. and Malah …

被引用次数：30 相关文章所有 7 个版本