Investigation of acoustic units for LVCSR systems

Using different acoustic, lexical and language modeling units for ASR of an under-resourced language–Amharic

MY Tachbelie, ST Abate, L Besacier - Speech Communication, 2014 - Elsevier

State-of-the-art large vocabulary continuous speech recognition systems use mostly phone
based acoustic models (AMs) and word based lexical and language models. However …

被引用次数：65 相关文章所有 7 个版本

[PDF] cuhk.edu.hk

Language model cross adaptation for LVCSR system combination

X Liu, MJF Gales, PC Woodland - Computer Speech & Language, 2013 - Elsevier

State-of-the-art large vocabulary continuous speech recognition (LVCSR) systems often
combine outputs from multiple sub-systems that may even be developed at different sites …

被引用次数：33 相关文章所有 16 个版本

[PDF] arxiv.org

Investigation of deep neural network acoustic modelling approaches for low resource accented mandarin speech recognition

X Xie, X Sui, X Liu, L Wang - arXiv preprint arXiv:2201.09432, 2022 - arxiv.org

The Mandarin Chinese language is known to be strongly influenced by a rich set of regional
accents, while Mandarin speech with each accent is quite low resource. Hence, an important …

被引用次数：4 相关文章所有 2 个版本

[PDF] cam.ac.uk

[PDF][PDF] The Cambridge University 2014 BOLT conversational telephone Mandarin Chinese LVCSR system for speech translation

X Liu, F Flego, L Wang, C Zhang… - … Annual Conference of …, 2015 - mi.eng.cam.ac.uk

This paper presents the development of the 2014 Cambridge University conversational
telephone Mandarin Chinese LVCSR system for the DARPA BOLT speech translation …

被引用次数：15 相关文章所有 9 个版本

[PDF] github.io

An experimental analysis on integrating multi-stream spectro-temporal, cepstral and pitch information for mandarin speech recognition

YB Wang, SW Li, L Lee - IEEE transactions on audio, speech …, 2013 - ieeexplore.ieee.org

Gabor features have been proposed for extracting spectro-temporal modulation information
from speech signals, and have been shown to yield large improvements in recognition …

被引用次数：11 相关文章所有 6 个版本

[PDF] cam.ac.uk

Joint training methods for tandem and hybrid speech recognition systems using deep neural networks

C Zhang - 2017 - repository.cam.ac.uk

Abstract Hidden Markov models (HMMs) have been the mainstream acoustic modelling
approach for state-of-the-art automatic speech recognition (ASR) systems over the past few …

被引用次数：10 相关文章所有 5 个版本

Syllable-Based Indonesian Automatic Speech Recognition.

DH Galatang - International Journal on Electrical …, 2020 - search.ebscohost.com

The syllable-based automatic speech recognition (ASR) systems commonly perform better
than the phoneme-based ones. This paper focuses on developing an Indonesian …

被引用次数：4 相关文章所有 2 个版本

[PDF] econstor.eu

[图书][B] Danish stød and automatic speech recognition

AS Kirkedal - 2016 - econstor.eu

Stød is a prosodic feature in Danish spoken language that is able to distinguish lexemes.
This distinction can also identify word class and has the potential to improve the …

被引用次数：7 相关文章所有 5 个版本

[PDF] isca-archive.org

[PDF][PDF] Improving LVCSR System Combination Using Neural Network Language Model Cross Adaptation.

X Liu, MJF Gales, PC Woodland - INTERSPEECH, 2011 - isca-archive.org

State-of-the-art large vocabulary continuous speech recognition (LVCSR) systems often
combine outputs from multiple subsystems developed at different sites. Cross system …

被引用次数：12 相关文章所有 7 个版本

[PDF] isca-archive.org

[PDF][PDF] A Study on Using Word-Level HMMs to Improve ASR Performance over State-of-the-Art Phone-Level Acoustic Modeling for LVCSR.

IF Chen, CH Lee - INTERSPEECH, 2012 - isca-archive.org

In this paper, we propose word-level hidden Markov models (HMMs) to supplement state-of-
the-art phone-based acoustic modeling in order to enhance the performance of automatic …

被引用次数：5 相关文章所有 3 个版本