A pruned rnnlm lattice-rescoring algorithm for automatic speech recognition

H Xu, T Chen, D Gao, Y Wang, K Li… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org
Lattice-rescoring is a common approach to take advantage of recurrent neural language
models in ASR, where a word-lattice is generated from 1st-pass decoding and the lattice is …

Live streaming speech recognition using deep bidirectional LSTM acoustic models and interpolated language models

J Jorge, A Giménez, JA Silvestre-Cerdà… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org
Although Long-Short Term Memory (LSTM) networks and deep Transformers are now
extensively used in offline ASR, it is unclear how best offline systems can be adapted to …

[PDF][PDF] Real-Time One-Pass Decoder for Speech Recognition Using LSTM Language Models.

J Jorge, A Giménez, J Iranzo-Sánchez, J Civera… - Interspeech, 2019 - drive.google.com
Abstract Recurrent Neural Networks, in particular Long-Short Term Memory (LSTM)
networks, are widely used in Automatic Speech Recognition for language modelling during …

A parallelizable lattice rescoring strategy with neural language models

K Li, D Povey, S Khudanpur - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
This paper proposes a parallel computation strategy and a posterior-based lattice expansion
algorithm for efficient lattice rescoring with neural language models (LMs) for automatic …

On assessing and developing spoken'grammatical error correction'systems

Y Lu, S Bannò, M Gales - Proceedings of the 17th Workshop on …, 2022 - aclanthology.org
Spoken 'grammatical error correction'(SGEC) is an important process to provide feedback for
second language learning. Due to a lack of end-to-end training data, SGEC is often …

Lm4kg: Improving common sense knowledge graphs with language models

J Omeliyanenko, A Zehe, L Hettinger… - The Semantic Web–ISWC …, 2020 - Springer
Abstract Language Models (LMs) and Knowledge Graphs (KGs) are both active research
areas in Machine Learning and Semantic Web. While LMs have brought great …

[HTML][HTML] Streaming cascade-based speech translation leveraged by a direct segmentation model

J Iranzo-Sánchez, J Jorge, P Baquero-Arnal… - Neural Networks, 2021 - Elsevier
The cascade approach to Speech Translation (ST) is based on a pipeline that concatenates
an Automatic Speech Recognition (ASR) system followed by a Machine Translation (MT) …

Exploiting future word contexts in neural network language models for speech recognition

X Chen, X Liu, Y Wang, A Ragni… - … on Audio, Speech …, 2019 - ieeexplore.ieee.org
Language modeling is a crucial component in a wide range of applications including speech
recognition. Language models (LMs) are usually constructed by splitting a sentence into …

LSTM-based one-pass decoder for low-latency streaming

J Jorge, A Giménez, J Iranzo-Sánchez… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Current state-of-the-art models based on Long-Short Term Memory (LSTM) networks have
been extensively used in ASR to improve performance. However, using LSTMs under a …

Impact of ASR performance on spoken grammatical error detection

Y Lu, MJF Gales, KM Knill, P Manakul, L Wang… - 2019 - repository.cam.ac.uk
Computer assisted language learning (CALL) systems aidlearners to monitor their progress
by providing scoring andfeedback on language assessment tasks. Free speaking tests al …