Artificial intelligence in the creative industries: a review

N Anantrasirichai, D Bull - Artificial intelligence review, 2022 - Springer
This paper reviews the current state of the art in artificial intelligence (AI) technologies and
applications in the context of the creative industries. A brief background of AI, and …

Deep learning for environmentally robust speech recognition: An overview of recent developments

Z Zhang, J Geiger, J Pohjalainen, AED Mousa… - ACM Transactions on …, 2018 - dl.acm.org
Eliminating the negative effect of non-stationary environmental noise is a long-standing
research topic for automatic speech recognition but still remains an important challenge …

[PDF][PDF] Performance evaluation of deep neural networks applied to speech recognition: RNN, LSTM and GRU

A Shewalkar, D Nyavanandi, SA Ludwig - Journal of Artificial …, 2019 - sciendo.com
Abstract Deep Neural Networks (DNN) are nothing but neural networks with many hidden
layers. DNNs are becoming popular in automatic speech recognition tasks which combines …

SEGAN: Speech enhancement generative adversarial network

S Pascual, A Bonafonte, J Serra - arXiv preprint arXiv:1703.09452, 2017 - arxiv.org
Current speech enhancement techniques operate on the spectral domain and/or exploit
some higher-level feature. The majority of them tackle a limited number of noise conditions …

[PDF][PDF] A Critical Review of Recurrent Neural Networks for Sequence Learning

ZC Lipton - arXiv Preprint, CoRR, abs/1506.00019, 2015 - arxiv.org
Countless learning tasks require awareness of time. Image captioning, speech synthesis,
and video game playing all require that a model generate sequences of outputs. In other …

EESEN: End-to-end speech recognition using deep RNN models and WFST-based decoding

Y Miao, M Gowayyed, F Metze - 2015 IEEE workshop on …, 2015 - ieeexplore.ieee.org
The performance of automatic speech recognition (ASR) has improved tremendously due to
the application of deep neural networks (DNNs). Despite this progress, building a new ASR …

A regression approach to speech enhancement based on deep neural networks

Y Xu, J Du, LR Dai, CH Lee - IEEE/ACM transactions on audio …, 2014 - ieeexplore.ieee.org
In contrast to the conventional minimum mean square error (MMSE)-based noise reduction
techniques, we propose a supervised method to enhance speech by means of finding a …

Hybrid speech recognition with deep bidirectional LSTM

A Graves, N Jaitly, A Mohamed - 2013 IEEE workshop on …, 2013 - ieeexplore.ieee.org
Deep Bidirectional LSTM (DBLSTM) recurrent neural networks have recently been shown to
give state-of-the-art performance on the TIMIT speech database. However, the results in that …

Deep learning: methods and applications

L Deng, D Yu - Foundations and trends® in signal processing, 2014 - nowpublishers.com
This monograph provides an overview of general deep learning methodology and its
applications to a variety of signal and information processing tasks. The application areas …

Speech recognition with deep recurrent neural networks

A Graves, A Mohamed, G Hinton - 2013 IEEE international …, 2013 - ieeexplore.ieee.org
Recurrent neural networks (RNNs) are a powerful model for sequential data. End-to-end
training methods such as Connectionist Temporal Classification make it possible to train …