Time-frequency masking in the complex domain for speech dereverberation and denoising
DS Williamson, DL Wang - IEEE/ACM transactions on audio …, 2017 - ieeexplore.ieee.org
In real-world situations, speech is masked by both background noise and reverberation,
which negatively affect perceptual quality and intelligibility. In this paper, we address …
which negatively affect perceptual quality and intelligibility. In this paper, we address …
Learning spectral mapping for speech dereverberation and denoising
In real-world environments, human speech is usually distorted by both reverberation and
background noise, which have negative effects on speech intelligibility and speech quality …
background noise, which have negative effects on speech intelligibility and speech quality …
Exploiting deep neural networks and head movements for robust binaural localization of multiple sources in reverberant environments
This paper presents a novel machine-hearing system that exploits deep neural networks
(DNNs) and head movements for robust binaural localization of multiple sources in …
(DNNs) and head movements for robust binaural localization of multiple sources in …
Binaural classification for reverberant speech segregation using deep neural networks
Speech signal degradation in real environments mainly results from room reverberation and
concurrent noise. While human listening is robust in complex auditory scenes, current …
concurrent noise. While human listening is robust in complex auditory scenes, current …
Assessing the generalization gap of learning-based speech enhancement systems in noisy and reverberant environments
The acoustic variability of noisy and reverberant speech mixtures is influenced by multiple
factors, such as the spectro-temporal characteristics of the target speaker and the interfering …
factors, such as the spectro-temporal characteristics of the target speaker and the interfering …
End-to-end binaural sound localisation from the raw waveform
P Vecchiotti, N Ma, S Squartini… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
A novel end-to-end binaural sound localisation approach is proposed which estimates the
azimuth of a sound source directly from the waveform. Instead of employing hand-crafted …
azimuth of a sound source directly from the waveform. Instead of employing hand-crafted …
On cross-corpus generalization of deep learning based speech enhancement
In recent years, supervised approaches using deep neural networks (DNNs) have become
the mainstream for speech enhancement. It has been established that DNNs generalize well …
the mainstream for speech enhancement. It has been established that DNNs generalize well …
Binaural localization of multiple sources in reverberant and noisy environments
J Woodruff, DL Wang - IEEE Transactions on Audio, Speech …, 2012 - ieeexplore.ieee.org
Sound source localization from a binaural input is a challenging problem, particularly when
multiple sources are active simultaneously and reverberation or background noise are …
multiple sources are active simultaneously and reverberation or background noise are …
Robust binaural localization of a target sound source by combining spectral source models and deep neural networks
Despite there being a clear evidence for top-down (eg, attentional) effects in biological
spatial hearing, relatively few machine hearing systems exploit the top-down model-based …
spatial hearing, relatively few machine hearing systems exploit the top-down model-based …
Features for masking-based monaural speech separation in reverberant conditions
M Delfarah, DL Wang - IEEE/ACM Transactions on Audio …, 2017 - ieeexplore.ieee.org
Monaural speech separation is a fundamental problem in speech and signal processing.
This problem can be approached from a supervised learning perspective by predicting an …
This problem can be approached from a supervised learning perspective by predicting an …