Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis

P Ochieng - Artificial Intelligence Review, 2023 - Springer
Deep neural networks (DNN) techniques have become pervasive in domains such as
natural language processing and computer vision. They have achieved great success in …

TinyLSTMs: Efficient neural speech enhancement for hearing aids

I Fedorov, M Stamenovic, C Jensen, LC Yang… - arXiv preprint arXiv …, 2020 - arxiv.org
Modern speech enhancement algorithms achieve remarkable noise suppression by means
of large recurrent neural networks (RNNs). However, large RNNs limit practical deployment …

Improved lite audio-visual speech enhancement

SY Chuang, HM Wang, Y Tsao - IEEE/ACM Transactions on …, 2022 - ieeexplore.ieee.org
Numerous studies have investigated the effectiveness of audio-visual multimodal learning
for speech enhancement (AVSE) tasks, seeking a solution that uses visual data as auxiliary …

Towards more efficient DNN-based speech enhancement using quantized correlation mask

S Abdullah, M Zamani, A Demosthenous - IEEE Access, 2021 - ieeexplore.ieee.org
Many studies on deep learning-based speech enhancement (SE) utilizing the computational
auditory scene analysis method typically employs the ideal binary mask or the ideal ratio …

Increasing compactness of deep learning based speech enhancement models with parameter pruning and quantization techniques

JY Wu, C Yu, SW Fu, CT Liu… - IEEE Signal …, 2019 - ieeexplore.ieee.org
The most recent studies on deep learning based speech enhancement (SE) are focused on
improving denoising performance. However, successful SE applications require striking a …

Lite audio-visual speech enhancement

SY Chuang, Y Tsao, CC Lo, HM Wang - arXiv preprint arXiv:2005.11769, 2020 - arxiv.org
Previous studies have confirmed the effectiveness of incorporating visual information into
speech enhancement (SE) systems. Despite improved denoising performance, two …

[PDF][PDF] Squeeze for sneeze: compact neural networks for cold and flu recognition

M Albes, Z Ren, B Schuller, N Cummins - 2020 - opus.bibliothek.uni-augsburg.de
In digital health applications, speech offers advantages over other physiological signals, in
that it can be easily collected, transmitted, and stored using mobile and Internet of Things …

[PDF][PDF] IA-NET: Acceleration and Compression of Speech Enhancement Using Integer-Adder Deep Neural Network.

YC Lin, YT Hsu, SW Fu, Y Tsao, TW Kuo - Interspeech, 2019 - academia.edu
Numerous compression and acceleration techniques achieved state-of-the-art results for
classification tasks in speech processing. However, the same techniques produce …

A study of joint effect on denoising techniques and visual cues to improve speech intelligibility in cochlear implant simulation

RY Tseng, TW Wang, SW Fu, CY Lee… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Speech perception is the key to verbal communication. For people with hearing loss, the
capability to recognize speech is restricted, particularly in a noisy environment or the …

MoEVC: A mixture of experts voice conversion system with sparse gating mechanism for online computation acceleration

YT Chang, YH Yang, YH Peng… - … on Chinese Spoken …, 2021 - ieeexplore.ieee.org
Owing to the recent advancements in deep learning technology, the performance of voice
conversion (VC) in terms of quality and similarity has significantly improved. However …