Adaptation algorithms for neural network-based speech recognition: An overview
We present a structured overview of adaptation algorithms for neural network-based speech
recognition, considering both hybrid hidden Markov model/neural network systems and end …
recognition, considering both hybrid hidden Markov model/neural network systems and end …
[PDF][PDF] 深度学习在手写汉字识别中的应用综述
金连文, 钟卓耀, 杨钊, 杨维信, 谢泽澄, 孙俊 - 自动化学报, 2016 - aas.net.cn
摘要手写汉字识别(Handwritten Chinese character recognition, HCCR)
是模式识别的一个重要研究领域, 最近几十年来得到了广泛的研究与关注 …
是模式识别的一个重要研究领域, 最近几十年来得到了广泛的研究与关注 …
Multi-domain neural network language generation for spoken dialogue systems
Moving from limited-domain natural language generation (NLG) to open domain is difficult
because the number of semantic input combinations grows exponentially with the number of …
because the number of semantic input combinations grows exponentially with the number of …
Adapting and controlling DNN-based speech synthesis using input codes
Methods for adapting and controlling the characteristics of output speech are important
topics in speech synthesis. In this work, we investigated the performance of DNN-based text …
topics in speech synthesis. In this work, we investigated the performance of DNN-based text …
Factorized neural transducer for efficient language model adaptation
In recent years, end-to-end (E2E) based automatic speech recognition (ASR) systems have
achieved great success due to their simplicity and promising performance. Neural …
achieved great success due to their simplicity and promising performance. Neural …
CUED-RNNLM—An open-source toolkit for efficient training and evaluation of recurrent neural network language models
In recent years, recurrent neural network language models (RNNLMs) have become
increasingly popular for a range of applications including speech recognition. However, the …
increasingly popular for a range of applications including speech recognition. However, the …
Innovative BERT-based reranking language models for speech recognition
More recently, Bidirectional Encoder Representations from Transformers (BERT) was
proposed and has achieved impressive success on many natural language processing …
proposed and has achieved impressive success on many natural language processing …
Transformer language models with LSTM-based cross-utterance information representation
The effective incorporation of cross-utterance information has the potential to improve
language models (LMs) for automatic speech recognition (ASR). To extract more powerful …
language models (LMs) for automatic speech recognition (ASR). To extract more powerful …
[PDF][PDF] Recurrent neural network language model adaptation for conversational speech recognition.
We propose two adaptation models for recurrent neural network language models
(RNNLMs) to capture topic effects and longdistance triggers for conversational automatic …
(RNNLMs) to capture topic effects and longdistance triggers for conversational automatic …
Safety first: conversational agents for health care
Automated dialogue systems represent a promising approach for health care promotion,
thanks to their ability to emulate the experience of face-to-face interactions between health …
thanks to their ability to emulate the experience of face-to-face interactions between health …