Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis

P Ochieng - Artificial Intelligence Review, 2023 - Springer
Deep neural networks (DNN) techniques have become pervasive in domains such as
natural language processing and computer vision. They have achieved great success in …

Raw waveform-based speech enhancement by fully convolutional networks

SW Fu, Y Tsao, X Lu, H Kawai - 2017 Asia-Pacific Signal and …, 2017 - ieeexplore.ieee.org
This study proposes a fully convolutional network (FCN) model for raw waveform-based
speech enhancement. The proposed system performs speech enhancement in an end-to …

Alternative objective functions for deep clustering

ZQ Wang, J Le Roux, JR Hershey - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
The recently proposed deep clustering framework represents a significant step towards
solving the cocktail party problem. This study proposes and compares a variety of alternative …

Complex spectrogram enhancement by convolutional neural network with multi-metrics learning

SW Fu, T Hu, Y Tsao, X Lu - 2017 IEEE 27th international …, 2017 - ieeexplore.ieee.org
This paper aims to address two issues existing in the current speech enhancement methods:
1) the difficulty of phase estimations; 2) a single objective function cannot consider multiple …

End-to-end speech separation with unfolded iterative phase reconstruction

ZQ Wang, JL Roux, DL Wang, JR Hershey - arXiv preprint arXiv …, 2018 - arxiv.org
This paper proposes an end-to-end approach for single-channel speaker-independent multi-
speaker speech separation, where time-frequency (TF) masking, the short-time Fourier …

Expediting TTS synthesis with adversarial vocoding

P Neekhara, C Donahue, M Puckette, S Dubnov… - arXiv preprint arXiv …, 2019 - arxiv.org
Recent approaches in text-to-speech (TTS) synthesis employ neural network strategies to
vocode perceptually-informed spectrogram representations directly into listenable …

[PDF][PDF] On the Influence of Modifying Magnitude and Phase Spectrum to Enhance Noisy Speech Signals.

HG Hirsch, M Gref - Interspeech, 2017 - isca-archive.org
Neural networks have proven their ability to be usefully applied as component of a speech
enhancement system. This is based on the known feature of neural nets to map regions …

Mismatch problem in deep-learning based speech enhancement

N Hou - 2022 - dr.ntu.edu.sg
Speech enhancement aims to suppress background noise in noisy speech signals in order
to improve speech perceptual quality and intelligibility. For tasks utilizing deep learning …

[PDF][PDF] Multi-Metrics Learning for Speech Enhancement

SW Fu, T Hu, Y Tsao, X Lu - 2016 - researchgate.net
(DNN) based speech enhancement method that aims to enhance magnitude and phase
components of speech signals simultaneously. The novelty of the proposed method is two …

[PDF][PDF] I. EARNED DEGREES

CHUI LEE - 2016 - chl.ece.gatech.edu
COLLEGE OF ENGINEERING Page 1 -1- CHIN-HUI LEE PROFESSOR SCHOOL OF
ELECTRICAL AND COMPUTER ENGINEERING JANUARY 2018 I. EARNED DEGREES • Ph.D …