Supervised speech separation based on deep learning: An overview

DL Wang, J Chen - IEEE/ACM transactions on audio, speech …, 2018 - ieeexplore.ieee.org
Speech separation is the task of separating target speech from background interference.
Traditionally, speech separation is studied as a signal processing problem. A more recent …

CASE-Net: Integrating local and non-local attention operations for speech enhancement

X Xu, W Tu, Y Yang - Speech Communication, 2023 - Elsevier
Local and non-local attention operations are two ubiquitous operations in the domain of
speech enhancement (SE), and they are effective to generate more discriminative patterns …

Raw waveform-based speech enhancement by fully convolutional networks

SW Fu, Y Tsao, X Lu, H Kawai - 2017 Asia-Pacific Signal and …, 2017 - ieeexplore.ieee.org
This study proposes a fully convolutional network (FCN) model for raw waveform-based
speech enhancement. The proposed system performs speech enhancement in an end-to …

Gated residual networks with dilated convolutions for monaural speech enhancement

K Tan, J Chen, DL Wang - IEEE/ACM transactions on audio …, 2018 - ieeexplore.ieee.org
For supervised speech enhancement, contextual information is important for accurate mask
estimation or spectral mapping. However, commonly used deep neural networks (DNNs) are …

[PDF][PDF] SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement.

SW Fu, Y Tsao, X Lu - Interspeech, 2016 - academia.edu
This paper proposes a signal-to-noise-ratio (SNR) aware convolutional neural network
(CNN) model for speech enhancement (SE). Because the CNN model can deal with local …

LSTM-convolutional-BLSTM encoder-decoder network for minimum mean-square error approach to speech enhancement

Z Wang, T Zhang, Y Shao, B Ding - Applied Acoustics, 2021 - Elsevier
In recent years, deep learning models have been employed for speech enhancement. Most
of the existing methods based on deep learning use fully Convolutional Neural Network …

Single channel speech enhancement using convolutional neural network

T Kounovsky, J Malek - 2017 IEEE International Workshop of …, 2017 - ieeexplore.ieee.org
Neural networks can be used to identify and remove noise from noisy speech spectrum
(denoisisng autoencoders, DAEs). The DAEs are typically implemented using the fully …

Source separation in ecoacoustics: A roadmap towards versatile soundscape information retrieval

TH Lin, Y Tsao - Remote Sensing in Ecology and Conservation, 2020 - Wiley Online Library
A comprehensive assessment of ecosystem dynamics requires the monitoring of biological,
physical and social changes. Changes that cannot be observed visually may be trackable …

Robust direction estimation with convolutional neural networks based steered response power

P Pertilä, E Cakir - 2017 IEEE International Conference on …, 2017 - ieeexplore.ieee.org
The steered response power (SRP) methods can be used to build a map of sound direction
likelihood. In the presence of interference and reverberation, the map will exhibit multiple …

Gated residual networks with dilated convolutions for supervised speech separation

K Tan, J Chen, DL Wang - 2018 IEEE International Conference …, 2018 - ieeexplore.ieee.org
In supervised speech separation, deep neural networks (DNNs) are typically employed to
predict an ideal time-frequency (TF) mask in order to remove background interference …