Juicer: A weighted finite-state transducer speech decoder

H Chung, S Lee, YK Lee - US Patent 9,396,722, 2016 - Google Patents

Disclosed are an apparatus and a method for detecting a speech endpoint using a WFST.
The apparatus in accordance with an embodiment of the present invention includes: a …

被引用次数：164 相关文章所有 4 个版本

[PDF] sinaidiagnostics.com

[PDF][PDF] Comparing open-source speech recognition toolkits

C Gaida, P Lange, R Petrick, P Proba… - 11th International …, 2014 - sinaidiagnostics.com

In this paper, a large-scale evaluation of open-source speech recognition toolkits is
described. Specifically, HTK in association with the decoders HDecode and Julius, CMU …

被引用次数：116 相关文章所有 20 个版本

[PDF] googleapis.com

Method for symbolic correction in human-machine interfaces

JCP Cortés, RL Azpitarte, JRN Cerdán… - US Patent …, 2015 - Google Patents

Disclosed embodiments include methods and systems for symbolic correction in human-
machine interfaces that comprise (a) implementing a language model;(b) implementing a …

被引用次数：104 相关文章所有 4 个版本

[PDF] arxiv.org

Rational recurrences

H Peng, R Schwartz, S Thomson, NA Smith - arXiv preprint arXiv …, 2018 - arxiv.org

Despite the tremendous empirical success of neural models in natural language processing,
many of them lack the strong intuitions that accompany classical machine learning …

被引用次数：47 相关文章所有 7 个版本

[PDF] researchgate.net

Analysis of MLP-based hierarchical phoneme posterior probability estimator

J Pinto, S Garimella, M Magimai-Doss… - … on Audio, Speech …, 2010 - ieeexplore.ieee.org

We analyze a simple hierarchical architecture consisting of two multilayer perceptron (MLP)
classifiers in tandem to estimate the phonetic class conditional probabilities. In this …

被引用次数：103 相关文章所有 15 个版本

Phone synchronous speech recognition with ctc lattices

Z Chen, Y Zhuang, Y Qian, K Yu - IEEE/ACM Transactions on …, 2016 - ieeexplore.ieee.org

Connectionist temporal classification (CTC) has recently shown improved performance and
efficiency in automatic speech recognition. One popular decoding implementation is to use a …

被引用次数：43 相关文章所有 2 个版本

[PDF] epfl.ch

The segmentation of multi-channel meeting recordings for automatic speech recognition

J Dines, J Vepa, T Hain - 2006 - infoscience.epfl.ch

One major research challenge in the domain of the analysis of meeting room data is the
automatic transcription of what is spoken during meetings, a task which has gained …

被引用次数：98 相关文章所有 19 个版本

[PDF] vutbr.cz

The AMI meeting transcription system: Progress and performance

T Hain, L Burget, J Dines, G Garau, M Karafiat… - … MD, USA, May 1-4, 2006 …, 2006 - Springer

We present the AMI 2006 system for the transcription of speech in meetings. The system was
jointly developed by multiple sites on the basis of the 2005 system for participation in the …

被引用次数：64 相关文章所有 13 个版本

[PDF] epfl.ch

Enhanced phone posteriors for improving speech recognition systems

H Ketabdar, H Bourlard - IEEE Transactions on Audio, Speech …, 2009 - ieeexplore.ieee.org

Using phone posterior probabilities has been increasingly explored for improving automatic
speech recognition (ASR) systems. In this paper, we propose two approaches for …

被引用次数：58 相关文章所有 16 个版本

[图书][B] Finite-State Techniques

S Mihov, KU Schulz - 2019 - books.google.com

Finite-state methods are the most efficient mechanisms for analysing textual and symbolic
data, providing elegant solutions for an immense number of practical problems in …

被引用次数：23 相关文章所有 4 个版本