Recurrent neural networks for noise reduction in robust asr.

N Anantrasirichai, D Bull - Artificial intelligence review, 2022 - Springer

This paper reviews the current state of the art in artificial intelligence (AI) technologies and
applications in the context of the creative industries. A brief background of AI, and …

被引用次数：591 相关文章所有 10 个版本

[PDF] arxiv.org

Deep learning for environmentally robust speech recognition: An overview of recent developments

Z Zhang, J Geiger, J Pohjalainen, AED Mousa… - ACM Transactions on …, 2018 - dl.acm.org

Eliminating the negative effect of non-stationary environmental noise is a long-standing
research topic for automatic speech recognition but still remains an important challenge …

被引用次数：423 相关文章所有 10 个版本

[PDF] sciendo.com

[PDF][PDF] Performance evaluation of deep neural networks applied to speech recognition: RNN, LSTM and GRU

A Shewalkar, D Nyavanandi, SA Ludwig - Journal of Artificial …, 2019 - sciendo.com

Abstract Deep Neural Networks (DNN) are nothing but neural networks with many hidden
layers. DNNs are becoming popular in automatic speech recognition tasks which combines …

被引用次数：543 相关文章所有 9 个版本

[PDF] arxiv.org

SEGAN: Speech enhancement generative adversarial network

S Pascual, A Bonafonte, J Serra - arXiv preprint arXiv:1703.09452, 2017 - arxiv.org

Current speech enhancement techniques operate on the spectral domain and/or exploit
some higher-level feature. The majority of them tackle a limited number of noise conditions …

被引用次数：1505 相关文章所有 4 个版本

[PDF] arxiv.org

[PDF][PDF] A Critical Review of Recurrent Neural Networks for Sequence Learning

ZC Lipton - arXiv Preprint, CoRR, abs/1506.00019, 2015 - arxiv.org

Countless learning tasks require awareness of time. Image captioning, speech synthesis,
and video game playing all require that a model generate sequences of outputs. In other …

被引用次数：3525 相关文章

[PDF] arxiv.org

EESEN: End-to-end speech recognition using deep RNN models and WFST-based decoding

Y Miao, M Gowayyed, F Metze - 2015 IEEE workshop on …, 2015 - ieeexplore.ieee.org

The performance of automatic speech recognition (ASR) has improved tremendously due to
the application of deep neural networks (DNNs). Despite this progress, building a new ASR …

被引用次数：981 相关文章所有 10 个版本

[PDF] ustc.edu.cn

A regression approach to speech enhancement based on deep neural networks

Y Xu, J Du, LR Dai, CH Lee - IEEE/ACM transactions on audio …, 2014 - ieeexplore.ieee.org

In contrast to the conventional minimum mean square error (MMSE)-based noise reduction
techniques, we propose a supervised method to enhance speech by means of finding a …

被引用次数：1528 相关文章所有 8 个版本

[PDF] toronto.edu

Hybrid speech recognition with deep bidirectional LSTM

A Graves, N Jaitly, A Mohamed - 2013 IEEE workshop on …, 2013 - ieeexplore.ieee.org

Deep Bidirectional LSTM (DBLSTM) recurrent neural networks have recently been shown to
give state-of-the-art performance on the TIMIT speech database. However, the results in that …

被引用次数：2328 相关文章所有 10 个版本

[PDF] nowpublishers.com

Deep learning: methods and applications

L Deng, D Yu - Foundations and trends® in signal processing, 2014 - nowpublishers.com

This monograph provides an overview of general deep learning methodology and its
applications to a variety of signal and information processing tasks. The application areas …

被引用次数：6157 相关文章所有 13 个版本

[PDF] cmu.edu

Speech recognition with deep recurrent neural networks

A Graves, A Mohamed, G Hinton - 2013 IEEE international …, 2013 - ieeexplore.ieee.org

Recurrent neural networks (RNNs) are a powerful model for sequential data. End-to-end
training methods such as Connectionist Temporal Classification make it possible to train …

被引用次数：12007 相关文章所有 18 个版本