Using different acoustic, lexical and language modeling units for ASR of an under-resourced language–Amharic

MY Tachbelie, ST Abate, L Besacier - Speech Communication, 2014 - Elsevier
State-of-the-art large vocabulary continuous speech recognition systems use mostly phone
based acoustic models (AMs) and word based lexical and language models. However …

Language model cross adaptation for LVCSR system combination

X Liu, MJF Gales, PC Woodland - Computer Speech & Language, 2013 - Elsevier
State-of-the-art large vocabulary continuous speech recognition (LVCSR) systems often
combine outputs from multiple sub-systems that may even be developed at different sites …

Investigation of deep neural network acoustic modelling approaches for low resource accented mandarin speech recognition

X Xie, X Sui, X Liu, L Wang - arXiv preprint arXiv:2201.09432, 2022 - arxiv.org
The Mandarin Chinese language is known to be strongly influenced by a rich set of regional
accents, while Mandarin speech with each accent is quite low resource. Hence, an important …

[PDF][PDF] The Cambridge University 2014 BOLT conversational telephone Mandarin Chinese LVCSR system for speech translation

X Liu, F Flego, L Wang, C Zhang… - … Annual Conference of …, 2015 - mi.eng.cam.ac.uk
This paper presents the development of the 2014 Cambridge University conversational
telephone Mandarin Chinese LVCSR system for the DARPA BOLT speech translation …

An experimental analysis on integrating multi-stream spectro-temporal, cepstral and pitch information for mandarin speech recognition

YB Wang, SW Li, L Lee - IEEE transactions on audio, speech …, 2013 - ieeexplore.ieee.org
Gabor features have been proposed for extracting spectro-temporal modulation information
from speech signals, and have been shown to yield large improvements in recognition …

Joint training methods for tandem and hybrid speech recognition systems using deep neural networks

C Zhang - 2017 - repository.cam.ac.uk
Abstract Hidden Markov models (HMMs) have been the mainstream acoustic modelling
approach for state-of-the-art automatic speech recognition (ASR) systems over the past few …

Syllable-Based Indonesian Automatic Speech Recognition.

DH Galatang - International Journal on Electrical …, 2020 - search.ebscohost.com
The syllable-based automatic speech recognition (ASR) systems commonly perform better
than the phoneme-based ones. This paper focuses on developing an Indonesian …

[图书][B] Danish stød and automatic speech recognition

AS Kirkedal - 2016 - econstor.eu
Stød is a prosodic feature in Danish spoken language that is able to distinguish lexemes.
This distinction can also identify word class and has the potential to improve the …

[PDF][PDF] Improving LVCSR System Combination Using Neural Network Language Model Cross Adaptation.

X Liu, MJF Gales, PC Woodland - INTERSPEECH, 2011 - isca-archive.org
State-of-the-art large vocabulary continuous speech recognition (LVCSR) systems often
combine outputs from multiple subsystems developed at different sites. Cross system …

[PDF][PDF] A Study on Using Word-Level HMMs to Improve ASR Performance over State-of-the-Art Phone-Level Acoustic Modeling for LVCSR.

IF Chen, CH Lee - INTERSPEECH, 2012 - isca-archive.org
In this paper, we propose word-level hidden Markov models (HMMs) to supplement state-of-
the-art phone-based acoustic modeling in order to enhance the performance of automatic …