Adaptation algorithms for neural network-based speech recognition: An overview

P Bell, J Fainberg, O Klejch, J Li… - IEEE Open Journal …, 2020 - ieeexplore.ieee.org
We present a structured overview of adaptation algorithms for neural network-based speech
recognition, considering both hybrid hidden Markov model/neural network systems and end …

[PDF][PDF] 深度学习在手写汉字识别中的应用综述

金连文, 钟卓耀, 杨钊, 杨维信, 谢泽澄, 孙俊 - 自动化学报, 2016 - aas.net.cn
摘要手写汉字识别(Handwritten Chinese character recognition, HCCR)
是模式识别的一个重要研究领域, 最近几十年来得到了广泛的研究与关注 …

Multi-domain neural network language generation for spoken dialogue systems

TH Wen, M Gasic, N Mrksic… - arXiv preprint arXiv …, 2016 - arxiv.org
Moving from limited-domain natural language generation (NLG) to open domain is difficult
because the number of semantic input combinations grows exponentially with the number of …

Adapting and controlling DNN-based speech synthesis using input codes

HT Luong, S Takaki, GE Henter… - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
Methods for adapting and controlling the characteristics of output speech are important
topics in speech synthesis. In this work, we investigated the performance of DNN-based text …

Factorized neural transducer for efficient language model adaptation

X Chen, Z Meng, S Parthasarathy… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
In recent years, end-to-end (E2E) based automatic speech recognition (ASR) systems have
achieved great success due to their simplicity and promising performance. Neural …

CUED-RNNLM—An open-source toolkit for efficient training and evaluation of recurrent neural network language models

X Chen, X Liu, Y Qian, MJF Gales… - … on acoustics, speech …, 2016 - ieeexplore.ieee.org
In recent years, recurrent neural network language models (RNNLMs) have become
increasingly popular for a range of applications including speech recognition. However, the …

Innovative BERT-based reranking language models for speech recognition

SH Chiu, B Chen - 2021 IEEE Spoken Language Technology …, 2021 - ieeexplore.ieee.org
More recently, Bidirectional Encoder Representations from Transformers (BERT) was
proposed and has achieved impressive success on many natural language processing …

Transformer language models with LSTM-based cross-utterance information representation

G Sun, C Zhang, PC Woodland - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
The effective incorporation of cross-utterance information has the potential to improve
language models (LMs) for automatic speech recognition (ASR). To extract more powerful …

[PDF][PDF] Recurrent neural network language model adaptation for conversational speech recognition.

K Li, H Xu, Y Wang, D Povey, S Khudanpur - Interspeech, 2018 - danielpovey.com
We propose two adaptation models for recurrent neural network language models
(RNNLMs) to capture topic effects and longdistance triggers for conversational automatic …

Safety first: conversational agents for health care

T Bickmore, H Trinh, R Asadi, S Olafsson - Studies in conversational UX …, 2018 - Springer
Automated dialogue systems represent a promising approach for health care promotion,
thanks to their ability to emulate the experience of face-to-face interactions between health …