Sixty years of frequency-domain monaural speech enhancement: From traditional to deep learning methods

C Zheng, H Zhang, W Liu, X Luo, A Li, X Li… - Trends in …, 2023 - journals.sagepub.com
Frequency-domain monaural speech enhancement has been extensively studied for over
60 years, and a great number of methods have been proposed and applied to many …

Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis

P Ochieng - Artificial Intelligence Review, 2023 - Springer
Deep neural networks (DNN) techniques have become pervasive in domains such as
natural language processing and computer vision. They have achieved great success in …

A multi-sensor signals denoising framework for tool state monitoring based on UKF-CycleGAN

X Wei, X Liu, C Yue, L Wang, SY Liang, Y Qin - Mechanical Systems and …, 2023 - Elsevier
The denoising of mechanical system is always an indispensable process in sensor signal
analysis. It directly affects the result of subsequent tool state monitoring and identification …

Speech enhancement with neural homomorphic synthesis

W Jiang, Z Liu, K Yu, F Wen - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Most deep learning-based speech enhancement methods operate directly on time-
frequency representations or learned features without making use of the model of speech …

A Two-Stage Approach to Quality Restoration of Bone-Conducted Speech

C Li, F Yang, J Yang - IEEE/ACM Transactions on Audio …, 2023 - ieeexplore.ieee.org
Bone-conducted speech is not susceptible to background noise but suffers from poor
speech quality and intelligibility due to the limited bandwidth. This paper proposes a two …

A two-stage complex network using cycle-consistent generative adversarial networks for speech enhancement

G Yu, Y Wang, H Wang, Q Zhang, C Zheng - Speech Communication, 2021 - Elsevier
Cycle-consistent generative adversarial networks (CycleGAN) have shown their promising
performance for speech enhancement (SE), while one intractable shortcoming of these …

Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech

SW Fu, KH Hung, Y Tsao, YCF Wang - arXiv preprint arXiv:2402.16321, 2024 - arxiv.org
Speech quality estimation has recently undergone a paradigm shift from human-hearing
expert designs to machine-learning models. However, current models rely mainly on …

First coarse, fine afterward: A lightweight two-stage complex approach for monaural speech enhancement

F Dang, H Chen, Q Hu, P Zhang, Y Yan - Speech Communication, 2023 - Elsevier
Deep neural network-based speech enhancement systems have achieved promising
results. However, the state-of-the-art (SOTA) models usually have too many parameters and …

LA-VocE: Low-SNR audio-visual speech enhancement using neural vocoders

R Mira, B Xu, J Donley, A Kumar… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Audio-visual speech enhancement aims to extract clean speech from a noisy environment
by leveraging not only the audio itself but also the target speaker's lip movements. This …

Speech Enhancement With Integration of Neural Homomorphic Synthesis and Spectral Masking

W Jiang, K Yu - IEEE/ACM Transactions on Audio, Speech, and …, 2023 - ieeexplore.ieee.org
Speech enhancement refers to suppressing the background noise to improve the perceptual
quality and intelligibility of the observed noisy speech. Recently, speech enhancement …