Method and apparatus for detecting speech endpoint using weighted finite state transducer

H Chung, S Lee, YK Lee - US Patent 9,396,722, 2016 - Google Patents
Disclosed are an apparatus and a method for detecting a speech endpoint using a WFST.
The apparatus in accordance with an embodiment of the present invention includes: a …

[PDF][PDF] Comparing open-source speech recognition toolkits

C Gaida, P Lange, R Petrick, P Proba… - 11th International …, 2014 - sinaidiagnostics.com
In this paper, a large-scale evaluation of open-source speech recognition toolkits is
described. Specifically, HTK in association with the decoders HDecode and Julius, CMU …

Method for symbolic correction in human-machine interfaces

JCP Cortés, RL Azpitarte, JRN Cerdán… - US Patent …, 2015 - Google Patents
Disclosed embodiments include methods and systems for symbolic correction in human-
machine interfaces that comprise (a) implementing a language model;(b) implementing a …

Rational recurrences

H Peng, R Schwartz, S Thomson, NA Smith - arXiv preprint arXiv …, 2018 - arxiv.org
Despite the tremendous empirical success of neural models in natural language processing,
many of them lack the strong intuitions that accompany classical machine learning …

Analysis of MLP-based hierarchical phoneme posterior probability estimator

J Pinto, S Garimella, M Magimai-Doss… - … on Audio, Speech …, 2010 - ieeexplore.ieee.org
We analyze a simple hierarchical architecture consisting of two multilayer perceptron (MLP)
classifiers in tandem to estimate the phonetic class conditional probabilities. In this …

Phone synchronous speech recognition with ctc lattices

Z Chen, Y Zhuang, Y Qian, K Yu - IEEE/ACM Transactions on …, 2016 - ieeexplore.ieee.org
Connectionist temporal classification (CTC) has recently shown improved performance and
efficiency in automatic speech recognition. One popular decoding implementation is to use a …

The segmentation of multi-channel meeting recordings for automatic speech recognition

J Dines, J Vepa, T Hain - 2006 - infoscience.epfl.ch
One major research challenge in the domain of the analysis of meeting room data is the
automatic transcription of what is spoken during meetings, a task which has gained …

The AMI meeting transcription system: Progress and performance

T Hain, L Burget, J Dines, G Garau, M Karafiat… - … MD, USA, May 1-4, 2006 …, 2006 - Springer
We present the AMI 2006 system for the transcription of speech in meetings. The system was
jointly developed by multiple sites on the basis of the 2005 system for participation in the …

Enhanced phone posteriors for improving speech recognition systems

H Ketabdar, H Bourlard - IEEE Transactions on Audio, Speech …, 2009 - ieeexplore.ieee.org
Using phone posterior probabilities has been increasingly explored for improving automatic
speech recognition (ASR) systems. In this paper, we propose two approaches for …

[图书][B] Finite-State Techniques

S Mihov, KU Schulz - 2019 - books.google.com
Finite-state methods are the most efficient mechanisms for analysing textual and symbolic
data, providing elegant solutions for an immense number of practical problems in …