Time-frequency masking in the complex domain for speech dereverberation and denoising

DS Williamson, DL Wang - IEEE/ACM transactions on audio …, 2017 - ieeexplore.ieee.org
In real-world situations, speech is masked by both background noise and reverberation,
which negatively affect perceptual quality and intelligibility. In this paper, we address …

Learning spectral mapping for speech dereverberation and denoising

K Han, Y Wang, DL Wang, WS Woods… - … on Audio, Speech …, 2015 - ieeexplore.ieee.org
In real-world environments, human speech is usually distorted by both reverberation and
background noise, which have negative effects on speech intelligibility and speech quality …

Exploiting deep neural networks and head movements for robust binaural localization of multiple sources in reverberant environments

N Ma, T May, GJ Brown - IEEE/ACM Transactions on Audio …, 2017 - ieeexplore.ieee.org
This paper presents a novel machine-hearing system that exploits deep neural networks
(DNNs) and head movements for robust binaural localization of multiple sources in …

Binaural classification for reverberant speech segregation using deep neural networks

Y Jiang, DL Wang, RS Liu… - IEEE/ACM Transactions …, 2014 - ieeexplore.ieee.org
Speech signal degradation in real environments mainly results from room reverberation and
concurrent noise. While human listening is robust in complex auditory scenes, current …

Assessing the generalization gap of learning-based speech enhancement systems in noisy and reverberant environments

P Gonzalez, TS Alstrøm, T May - IEEE/ACM Transactions on …, 2023 - ieeexplore.ieee.org
The acoustic variability of noisy and reverberant speech mixtures is influenced by multiple
factors, such as the spectro-temporal characteristics of the target speaker and the interfering …

End-to-end binaural sound localisation from the raw waveform

P Vecchiotti, N Ma, S Squartini… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
A novel end-to-end binaural sound localisation approach is proposed which estimates the
azimuth of a sound source directly from the waveform. Instead of employing hand-crafted …

On cross-corpus generalization of deep learning based speech enhancement

A Pandey, DL Wang - IEEE/ACM transactions on audio, speech …, 2020 - ieeexplore.ieee.org
In recent years, supervised approaches using deep neural networks (DNNs) have become
the mainstream for speech enhancement. It has been established that DNNs generalize well …

Binaural localization of multiple sources in reverberant and noisy environments

J Woodruff, DL Wang - IEEE Transactions on Audio, Speech …, 2012 - ieeexplore.ieee.org
Sound source localization from a binaural input is a challenging problem, particularly when
multiple sources are active simultaneously and reverberation or background noise are …

Robust binaural localization of a target sound source by combining spectral source models and deep neural networks

N Ma, JA Gonzalez, GJ Brown - IEEE/ACM Transactions on …, 2018 - ieeexplore.ieee.org
Despite there being a clear evidence for top-down (eg, attentional) effects in biological
spatial hearing, relatively few machine hearing systems exploit the top-down model-based …

Features for masking-based monaural speech separation in reverberant conditions

M Delfarah, DL Wang - IEEE/ACM Transactions on Audio …, 2017 - ieeexplore.ieee.org
Monaural speech separation is a fundamental problem in speech and signal processing.
This problem can be approached from a supervised learning perspective by predicting an …