[图书][B] Speech enhancement: theory and practice

PC Loizou - 2007 - taylorfrancis.com
The first book to provide comprehensive and up-to-date coverage of all major speech
enhancement algorithms proposed in the last two decades, Speech Enhancement: Theory …

Enhancements in automatic Kannada speech recognition system by background noise elimination and alternate acoustic modelling

G Thimmaraja Yadava, HS Jayanna - International Journal of Speech …, 2020 - Springer
In this paper, the improvements in the recently implemented Kannada speech recognition
system is demonstrated in detail. The Kannada automatic speech recognition (ASR) system …

Robust features in deep-learning-based speech recognition

V Mitra, H Franco, RM Stern, J Van Hout… - New Era for Robust …, 2017 - Springer
Recent progress in deep learning has revolutionized speech recognition research, with
Deep Neural Networks (DNNs) becoming the new state of the art for acoustic modeling …

Speech enhancement by combining spectral subtraction and minimum mean square error-spectrum power estimator based on zero crossing

TG Yadava, HS Jayanna - International Journal of Speech Technology, 2019 - Springer
Speech data collected under uncontrolled environment need to be processed to build a
robust automatic speech recognition system. In this paper, a method is proposed to process …

Voice activity detection using harmonic frequency components in likelihood ratio test

LN Tan, BJ Borgstrom, A Alwan - 2010 IEEE International …, 2010 - ieeexplore.ieee.org
This paper proposes a new statistical model-based likelihood ratio test (LRT) VAD to obtain
reliable speech/non-speech decisions. In the proposed method, the likelihood ratio (LR) is …

SASE: Self-Adaptive noise distribution network for Speech Enhancement with Federated Learning using heterogeneous data

Z Lin, B Zeng, H Hu, Y Huang, L Xu, Z Yao - Knowledge-Based Systems, 2023 - Elsevier
Correlation of speech data is ubiquitous in real-world environments. Heterogeneous data
involving different distributions hinder federated learning (FL) significantly, which can lead to …

Voice/non-voice detection using phase of zero frequency filtered speech signal

SBS Kumar, KS Rao - Speech Communication, 2016 - Elsevier
Voice/non-voice detection refers to the task of detecting the presence or absence of vocal
folds activity regions in the speech signal. Most of the existing state-of-the-art methods …

A pitch based noise estimation technique for robust speech recognition with missing data

JA Morales-Cordovilla, N Ma, V Sánchez… - … , Speech and Signal …, 2011 - ieeexplore.ieee.org
This paper presents a noise estimation technique based on knowledge of pitch information
for robust speech recognition. In the first stage the noise is estimated by means of …

[PDF][PDF] Creation and comparison of language and acoustic models using Kaldi for noisy and enhanced speech data

TG Yadava, HS Jayanna - International Journal of Intelligent Systems …, 2018 - academia.edu
Acoustic Models (AMs) are developed using the speech recognition toolkit Kaldi for noisy
and enhanced speech data to build an Automatic Speech Recognition (ASR) system for …

A modified Ephraim-Malah noise suppression rule for automatic speech recognition

R Gemello, F Mana, R De Mori - 2004 IEEE International …, 2004 - ieeexplore.ieee.org
A soft decision gain modification is introduced and applied to the Ephraim-Malah gain
function based on maximum mean square error estimation (MMSE)(Ephraim, Y. and Malah …