Speaker and session variability in GMM-based speaker verification

P Kenny, G Boulianne, P Ouellet… - IEEE Transactions on …, 2007 - ieeexplore.ieee.org
We present a corpus-based approach to speaker verification in which maximum-likelihood II
criteria are used to train a large-scale generative model of speaker and session variability …

A study on integrating acoustic-phonetic information into lattice rescoring for automatic speech recognition

SM Siniscalchi, CH Lee - Speech communication, 2009 - Elsevier
In this paper, a lattice rescoring approach to integrating acoustic-phonetic information into
automatic speech recognition (ASR) is described. Additional information over what is used …

An ensemble speaker and speaking environment modeling approach to robust speech recognition

Y Tsao, CH Lee - IEEE transactions on audio, speech, and …, 2009 - ieeexplore.ieee.org
We propose an ensemble speaker and speaking environment modeling (ESSEM) approach
to characterizing environments in order to enhance performance robustness of automatic …

Assamese spoken query system to access the price of agricultural commodities

S Shahnawazuddin, D Thotappa… - 2013 National …, 2013 - ieeexplore.ieee.org
In this work, a spoken query system developed for accessing the price of agricultural
commodities in Assamese language is described. The developed system enables the user …

Mouth gesture and voice command based robot command interface

JB Gomez, A Ceballos, F Prieto… - 2009 IEEE International …, 2009 - ieeexplore.ieee.org
In this paper we present a voice command and mouth gesture based robot command
interface which is capable of controlling three degrees of freedom. The gesture set was …

Approximate test risk bound minimization through soft margin estimation

J Li, M Yuan, CH Lee - IEEE transactions on audio, speech, and …, 2007 - ieeexplore.ieee.org
Inspired by the great success of margin-based classifiers, there is a trend to incorporate the
margin concept into hidden Markov modeling for speech recognition. Several attempts …

[PDF][PDF] Low-memory fast on-line adaptation for acoustically mismatched children's speech recognition.

S Shahnawazuddin, R Sinha - INTERSPEECH, 2015 - isca-archive.org
This work focuses on the issues and the challenges in acoustic adaptation in context of on-
line children's speech recognition. When children's speech is decoded on adults' speech …

Low complexity on-line adaptation techniques in context of assamese spoken query system

S Shahnawazuddin, KT Deepak, BD Sarma… - Journal of Signal …, 2015 - Springer
In this work, we present the development of an Assamese spoken query (SQ) system for
accessing the price of agricultural commodities. The developed system intends to make the …

A phonetic feature based lattice rescoring approach to LVCSR

SM Siniscalchi, T Svendsen… - 2009 IEEE International …, 2009 - ieeexplore.ieee.org
Large Vocabulary Continuous Speech Recognition (LVCSR) systems decode the input
speech using diverse information sources, such as acoustic, lexical, and linguistic. Although …

Sparse coding over redundant dictionaries for fast adaptation of speech recognition system

S Shahnawazuddin, R Sinha - Computer Speech & Language, 2017 - Elsevier
This work presents a novel use of the sparse coding over redundant dictionary for fast
adaptation of the acoustic models in the hidden Markov model-based automatic speech …