Artificial intelligence in the creative industries: a review
N Anantrasirichai, D Bull - Artificial intelligence review, 2022 - Springer
This paper reviews the current state of the art in artificial intelligence (AI) technologies and
applications in the context of the creative industries. A brief background of AI, and …
applications in the context of the creative industries. A brief background of AI, and …
Deep learning for environmentally robust speech recognition: An overview of recent developments
Eliminating the negative effect of non-stationary environmental noise is a long-standing
research topic for automatic speech recognition but still remains an important challenge …
research topic for automatic speech recognition but still remains an important challenge …
[PDF][PDF] Performance evaluation of deep neural networks applied to speech recognition: RNN, LSTM and GRU
A Shewalkar, D Nyavanandi, SA Ludwig - Journal of Artificial …, 2019 - sciendo.com
Abstract Deep Neural Networks (DNN) are nothing but neural networks with many hidden
layers. DNNs are becoming popular in automatic speech recognition tasks which combines …
layers. DNNs are becoming popular in automatic speech recognition tasks which combines …
SEGAN: Speech enhancement generative adversarial network
Current speech enhancement techniques operate on the spectral domain and/or exploit
some higher-level feature. The majority of them tackle a limited number of noise conditions …
some higher-level feature. The majority of them tackle a limited number of noise conditions …
[PDF][PDF] A Critical Review of Recurrent Neural Networks for Sequence Learning
ZC Lipton - arXiv Preprint, CoRR, abs/1506.00019, 2015 - arxiv.org
Countless learning tasks require awareness of time. Image captioning, speech synthesis,
and video game playing all require that a model generate sequences of outputs. In other …
and video game playing all require that a model generate sequences of outputs. In other …
EESEN: End-to-end speech recognition using deep RNN models and WFST-based decoding
The performance of automatic speech recognition (ASR) has improved tremendously due to
the application of deep neural networks (DNNs). Despite this progress, building a new ASR …
the application of deep neural networks (DNNs). Despite this progress, building a new ASR …
A regression approach to speech enhancement based on deep neural networks
In contrast to the conventional minimum mean square error (MMSE)-based noise reduction
techniques, we propose a supervised method to enhance speech by means of finding a …
techniques, we propose a supervised method to enhance speech by means of finding a …
Hybrid speech recognition with deep bidirectional LSTM
Deep Bidirectional LSTM (DBLSTM) recurrent neural networks have recently been shown to
give state-of-the-art performance on the TIMIT speech database. However, the results in that …
give state-of-the-art performance on the TIMIT speech database. However, the results in that …
Deep learning: methods and applications
This monograph provides an overview of general deep learning methodology and its
applications to a variety of signal and information processing tasks. The application areas …
applications to a variety of signal and information processing tasks. The application areas …
Speech recognition with deep recurrent neural networks
Recurrent neural networks (RNNs) are a powerful model for sequential data. End-to-end
training methods such as Connectionist Temporal Classification make it possible to train …
training methods such as Connectionist Temporal Classification make it possible to train …