Supervised speech separation based on deep learning: An overview
Speech separation is the task of separating target speech from background interference.
Traditionally, speech separation is studied as a signal processing problem. A more recent …
Traditionally, speech separation is studied as a signal processing problem. A more recent …
CASE-Net: Integrating local and non-local attention operations for speech enhancement
Local and non-local attention operations are two ubiquitous operations in the domain of
speech enhancement (SE), and they are effective to generate more discriminative patterns …
speech enhancement (SE), and they are effective to generate more discriminative patterns …
Raw waveform-based speech enhancement by fully convolutional networks
This study proposes a fully convolutional network (FCN) model for raw waveform-based
speech enhancement. The proposed system performs speech enhancement in an end-to …
speech enhancement. The proposed system performs speech enhancement in an end-to …
Gated residual networks with dilated convolutions for monaural speech enhancement
For supervised speech enhancement, contextual information is important for accurate mask
estimation or spectral mapping. However, commonly used deep neural networks (DNNs) are …
estimation or spectral mapping. However, commonly used deep neural networks (DNNs) are …
[PDF][PDF] SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement.
This paper proposes a signal-to-noise-ratio (SNR) aware convolutional neural network
(CNN) model for speech enhancement (SE). Because the CNN model can deal with local …
(CNN) model for speech enhancement (SE). Because the CNN model can deal with local …
LSTM-convolutional-BLSTM encoder-decoder network for minimum mean-square error approach to speech enhancement
Z Wang, T Zhang, Y Shao, B Ding - Applied Acoustics, 2021 - Elsevier
In recent years, deep learning models have been employed for speech enhancement. Most
of the existing methods based on deep learning use fully Convolutional Neural Network …
of the existing methods based on deep learning use fully Convolutional Neural Network …
Single channel speech enhancement using convolutional neural network
T Kounovsky, J Malek - 2017 IEEE International Workshop of …, 2017 - ieeexplore.ieee.org
Neural networks can be used to identify and remove noise from noisy speech spectrum
(denoisisng autoencoders, DAEs). The DAEs are typically implemented using the fully …
(denoisisng autoencoders, DAEs). The DAEs are typically implemented using the fully …
Source separation in ecoacoustics: A roadmap towards versatile soundscape information retrieval
A comprehensive assessment of ecosystem dynamics requires the monitoring of biological,
physical and social changes. Changes that cannot be observed visually may be trackable …
physical and social changes. Changes that cannot be observed visually may be trackable …
Robust direction estimation with convolutional neural networks based steered response power
The steered response power (SRP) methods can be used to build a map of sound direction
likelihood. In the presence of interference and reverberation, the map will exhibit multiple …
likelihood. In the presence of interference and reverberation, the map will exhibit multiple …
Gated residual networks with dilated convolutions for supervised speech separation
In supervised speech separation, deep neural networks (DNNs) are typically employed to
predict an ideal time-frequency (TF) mask in order to remove background interference …
predict an ideal time-frequency (TF) mask in order to remove background interference …