[图书][B] Speech enhancement: theory and practice
PC Loizou - 2007 - taylorfrancis.com
The first book to provide comprehensive and up-to-date coverage of all major speech
enhancement algorithms proposed in the last two decades, Speech Enhancement: Theory …
enhancement algorithms proposed in the last two decades, Speech Enhancement: Theory …
Enhancements in automatic Kannada speech recognition system by background noise elimination and alternate acoustic modelling
G Thimmaraja Yadava, HS Jayanna - International Journal of Speech …, 2020 - Springer
In this paper, the improvements in the recently implemented Kannada speech recognition
system is demonstrated in detail. The Kannada automatic speech recognition (ASR) system …
system is demonstrated in detail. The Kannada automatic speech recognition (ASR) system …
Robust features in deep-learning-based speech recognition
Recent progress in deep learning has revolutionized speech recognition research, with
Deep Neural Networks (DNNs) becoming the new state of the art for acoustic modeling …
Deep Neural Networks (DNNs) becoming the new state of the art for acoustic modeling …
Speech enhancement by combining spectral subtraction and minimum mean square error-spectrum power estimator based on zero crossing
TG Yadava, HS Jayanna - International Journal of Speech Technology, 2019 - Springer
Speech data collected under uncontrolled environment need to be processed to build a
robust automatic speech recognition system. In this paper, a method is proposed to process …
robust automatic speech recognition system. In this paper, a method is proposed to process …
Voice activity detection using harmonic frequency components in likelihood ratio test
LN Tan, BJ Borgstrom, A Alwan - 2010 IEEE International …, 2010 - ieeexplore.ieee.org
This paper proposes a new statistical model-based likelihood ratio test (LRT) VAD to obtain
reliable speech/non-speech decisions. In the proposed method, the likelihood ratio (LR) is …
reliable speech/non-speech decisions. In the proposed method, the likelihood ratio (LR) is …
SASE: Self-Adaptive noise distribution network for Speech Enhancement with Federated Learning using heterogeneous data
Z Lin, B Zeng, H Hu, Y Huang, L Xu, Z Yao - Knowledge-Based Systems, 2023 - Elsevier
Correlation of speech data is ubiquitous in real-world environments. Heterogeneous data
involving different distributions hinder federated learning (FL) significantly, which can lead to …
involving different distributions hinder federated learning (FL) significantly, which can lead to …
Voice/non-voice detection using phase of zero frequency filtered speech signal
Voice/non-voice detection refers to the task of detecting the presence or absence of vocal
folds activity regions in the speech signal. Most of the existing state-of-the-art methods …
folds activity regions in the speech signal. Most of the existing state-of-the-art methods …
A pitch based noise estimation technique for robust speech recognition with missing data
This paper presents a noise estimation technique based on knowledge of pitch information
for robust speech recognition. In the first stage the noise is estimated by means of …
for robust speech recognition. In the first stage the noise is estimated by means of …
[PDF][PDF] Creation and comparison of language and acoustic models using Kaldi for noisy and enhanced speech data
TG Yadava, HS Jayanna - International Journal of Intelligent Systems …, 2018 - academia.edu
Acoustic Models (AMs) are developed using the speech recognition toolkit Kaldi for noisy
and enhanced speech data to build an Automatic Speech Recognition (ASR) system for …
and enhanced speech data to build an Automatic Speech Recognition (ASR) system for …
A modified Ephraim-Malah noise suppression rule for automatic speech recognition
A soft decision gain modification is introduced and applied to the Ephraim-Malah gain
function based on maximum mean square error estimation (MMSE)(Ephraim, Y. and Malah …
function based on maximum mean square error estimation (MMSE)(Ephraim, Y. and Malah …