An overview of noise-robust automatic speech recognition

J Li, L Deng, Y Gong… - IEEE/ACM Transactions …, 2014 - ieeexplore.ieee.org
New waves of consumer-centric applications, such as voice search and voice interaction
with mobile devices and home entertainment systems, increasingly require automatic …

An overview of lead and accompaniment separation in music

Z Rafii, A Liutkus, FR Stöter, SI Mimilakis… - … on Audio, Speech …, 2018 - ieeexplore.ieee.org
Popular music is often composed of an accompaniment and a lead component, the latter
typically consisting of vocals. Filtering such mixtures to extract one or both components has …

Binary and ratio time-frequency masks for robust speech recognition

S Srinivasan, N Roman, DL Wang - Speech Communication, 2006 - Elsevier
A time-varying Wiener filter specifies the ratio of a target signal and a noisy mixture in a local
time-frequency unit. We estimate this ratio using a binaural processor and derive a ratio time …

CASA-based robust speaker identification

X Zhao, Y Shao, DL Wang - IEEE Transactions on Audio …, 2012 - ieeexplore.ieee.org
Conventional speaker recognition systems perform poorly under noisy conditions. Inspired
by auditory perception, computational auditory scene analysis (CASA) typically segregates …

An auditory-based feature for robust speech recognition

Y Shao, Z Jin, DL Wang… - 2009 IEEE International …, 2009 - ieeexplore.ieee.org
A conventional automatic speech recognizer does not perform well in the presence of noise,
while human listeners are able to segregate and recognize speech in noisy conditions. We …

Missing-feature approaches in speech recognition

B Raj, RM Stern - IEEE Signal Processing Magazine, 2005 - ieeexplore.ieee.org
In this article we have reviewed a wide variety of techniques based on the identification of
missing spectral features that have proved effective in reducing the error rates of automatic …

Robust speaker identification using auditory features and computational auditory scene analysis

Y Shao, DL Wang - 2008 IEEE international conference on …, 2008 - ieeexplore.ieee.org
The performance of speaker recognition systems drop significantly under noisy conditions.
To improve robustness, we have recently proposed novel auditory features and a robust …

Compressive sensing for missing data imputation in noise robust speech recognition

JF Gemmeke, H Van Hamme… - IEEE Journal of …, 2010 - ieeexplore.ieee.org
An effective way to increase the noise robustness of automatic speech recognition is to label
noisy speech features as either reliable or unreliable (missing), and to replace (impute) the …

A computational auditory scene analysis system for speech segregation and robust speech recognition

Y Shao, S Srinivasan, Z Jin, DL Wang - Computer Speech & Language, 2010 - Elsevier
A conventional automatic speech recognizer does not perform well in the presence of
multiple sound sources, while human listeners are able to segregate and recognize a signal …

Reaching over the gap: A review of efforts to link human and automatic speech recognition research

O Scharenborg - Speech Communication, 2007 - Elsevier
The fields of human speech recognition (HSR) and automatic speech recognition (ASR)
both investigate parts of the speech recognition process and have word recognition as their …