Improving reference speaker weighting adaptation by the use of maximum-likelihood reference speakers

P Kenny, G Boulianne, P Ouellet… - IEEE Transactions on …, 2007 - ieeexplore.ieee.org

We present a corpus-based approach to speaker verification in which maximum-likelihood II
criteria are used to train a large-scale generative model of speaker and session variability …

被引用次数：387 相关文章所有 17 个版本

A study on integrating acoustic-phonetic information into lattice rescoring for automatic speech recognition

SM Siniscalchi, CH Lee - Speech communication, 2009 - Elsevier

In this paper, a lattice rescoring approach to integrating acoustic-phonetic information into
automatic speech recognition (ASR) is described. Additional information over what is used …

被引用次数：75 相关文章所有 5 个版本

[PDF] core.ac.uk

An ensemble speaker and speaking environment modeling approach to robust speech recognition

Y Tsao, CH Lee - IEEE transactions on audio, speech, and …, 2009 - ieeexplore.ieee.org

We propose an ensemble speaker and speaking environment modeling (ESSEM) approach
to characterizing environments in order to enhance performance robustness of automatic …

被引用次数：63 相关文章所有 13 个版本

[PDF] researchgate.net

Assamese spoken query system to access the price of agricultural commodities

S Shahnawazuddin, D Thotappa… - 2013 National …, 2013 - ieeexplore.ieee.org

In this work, a spoken query system developed for accessing the price of agricultural
commodities in Assamese language is described. The developed system enables the user …

被引用次数：29 相关文章所有 4 个版本

Mouth gesture and voice command based robot command interface

JB Gomez, A Ceballos, F Prieto… - 2009 IEEE International …, 2009 - ieeexplore.ieee.org

In this paper we present a voice command and mouth gesture based robot command
interface which is capable of controlling three degrees of freedom. The gesture set was …

被引用次数：36 相关文章所有 4 个版本

[PDF] columbia.edu

Approximate test risk bound minimization through soft margin estimation

J Li, M Yuan, CH Lee - IEEE transactions on audio, speech, and …, 2007 - ieeexplore.ieee.org

Inspired by the great success of margin-based classifiers, there is a trend to incorporate the
margin concept into hidden Markov modeling for speech recognition. Several attempts …

被引用次数：53 相关文章所有 14 个版本

[PDF] isca-archive.org

[PDF][PDF] Low-memory fast on-line adaptation for acoustically mismatched children's speech recognition.

S Shahnawazuddin, R Sinha - INTERSPEECH, 2015 - isca-archive.org

This work focuses on the issues and the challenges in acoustic adaptation in context of on-
line children's speech recognition. When children's speech is decoded on adults' speech …

被引用次数：15 相关文章所有 5 个版本

[PDF] researchgate.net

Low complexity on-line adaptation techniques in context of assamese spoken query system

S Shahnawazuddin, KT Deepak, BD Sarma… - Journal of Signal …, 2015 - Springer

In this work, we present the development of an Assamese spoken query (SQ) system for
accessing the price of agricultural commodities. The developed system intends to make the …

被引用次数：13 相关文章所有 5 个版本

[PDF] academia.edu

A phonetic feature based lattice rescoring approach to LVCSR

SM Siniscalchi, T Svendsen… - 2009 IEEE International …, 2009 - ieeexplore.ieee.org

Large Vocabulary Continuous Speech Recognition (LVCSR) systems decode the input
speech using diverse information sources, such as acoustic, lexical, and linguistic. Although …

被引用次数：20 相关文章所有 8 个版本

Sparse coding over redundant dictionaries for fast adaptation of speech recognition system

S Shahnawazuddin, R Sinha - Computer Speech & Language, 2017 - Elsevier

This work presents a novel use of the sparse coding over redundant dictionary for fast
adaptation of the acoustic models in the hidden Markov model-based automatic speech …

被引用次数：11 相关文章所有 2 个版本