Word error rate estimation for speech recognition: e-WER

A Ali, S Renals - Proceedings of the 56th Annual Meeting of the …, 2018 - aclanthology.org
Measuring the performance of automatic speech recognition (ASR) systems requires
manually transcribed data in order to compute the word error rate (WER), which is often time …

Error detection and accuracy estimation in automatic speech recognition using deep bidirectional recurrent neural networks

A Ogawa, T Hori - Speech Communication, 2017 - Elsevier
Recurrent neural networks (RNNs) have recently been applied as the classifiers for
sequential labeling problems. In this paper, deep bidirectional RNNs (DBRNNs) are applied …

ASR error management for improving spoken language understanding

E Simonnet, S Ghannay, N Camelin, Y Esteve… - arXiv preprint arXiv …, 2017 - arxiv.org
This paper addresses the problem of automatic speech recognition (ASR) error detection
and their use for improving spoken language understanding (SLU) systems. In this study, the …

Speaker-adapted confidence measures for ASR using deep bidirectional recurrent neural networks

MA Del-Agua, A Gimenez, A Sanchis… - … on Audio, Speech …, 2018 - ieeexplore.ieee.org
In the last years, deep bidirectional recurrent neural networks (DBRNN) and DBRNN with
long short-term memory cells (DBLSTM) have outperformed the most accurate classifiers for …

Learning morpheme representation for mongolian named entity recognition

W Wang, F Bao, G Gao - Neural Processing Letters, 2019 - Springer
Traditional approaches to Mongolian named entity recognition heavily rely on the feature
engineering. Even worse, the complex morphological structure of Mongolian words made …

Insights on neural representations for end-to-end speech recognition

A Ollerenshaw, MA Jalal, T Hain - arXiv preprint arXiv:2205.09456, 2022 - arxiv.org
End-to-end automatic speech recognition (ASR) models aim to learn a generalised speech
representation. However, there are limited tools available to understand the internal …

Automated conversation review to surface virtual assistant misunderstandings: Reducing cost and increasing privacy

I Beaver, A Mueen - Proceedings of the AAAI Conference on Artificial …, 2020 - ojs.aaai.org
With the rise of Intelligent Virtual Assistants (IVAs), there is a necessary rise in human effort
to identify conversations containing misunderstood user inputs. These conversations …

Multi-dialect Arabic broadcast speech recognition

AMAM Ali - 2018 - era.ed.ac.uk
Dialectal Arabic speech research suffers from the lack of labelled resources and
standardised orthography. There are three main challenges in dialectal Arabic speech …

Mongolian named entity recognition with bidirectional recurrent neural networks

W Wang, F Bao, G Gao - 2016 IEEE 28th International …, 2016 - ieeexplore.ieee.org
Traditional approaches to Named Entity Recognition almost heavily rely on feature
engineering. In this paper, we introduce a kind of bidirectional recurrent neural network with …

System-independent asr error detection and classification using recurrent neural network

R Errattahi, AEL Hannani, T Hain… - Computer Speech & …, 2019 - Elsevier
This paper addresses errors in continuous Automatic Speech Recognition (ASR) in two
stages: error detection and error type classification. Unlike the majority of research in this …