Speaker and session variability in GMM-based speaker verification
P Kenny, G Boulianne, P Ouellet… - IEEE Transactions on …, 2007 - ieeexplore.ieee.org
We present a corpus-based approach to speaker verification in which maximum-likelihood II
criteria are used to train a large-scale generative model of speaker and session variability …
criteria are used to train a large-scale generative model of speaker and session variability …
A study on integrating acoustic-phonetic information into lattice rescoring for automatic speech recognition
SM Siniscalchi, CH Lee - Speech communication, 2009 - Elsevier
In this paper, a lattice rescoring approach to integrating acoustic-phonetic information into
automatic speech recognition (ASR) is described. Additional information over what is used …
automatic speech recognition (ASR) is described. Additional information over what is used …
An ensemble speaker and speaking environment modeling approach to robust speech recognition
We propose an ensemble speaker and speaking environment modeling (ESSEM) approach
to characterizing environments in order to enhance performance robustness of automatic …
to characterizing environments in order to enhance performance robustness of automatic …
Assamese spoken query system to access the price of agricultural commodities
S Shahnawazuddin, D Thotappa… - 2013 National …, 2013 - ieeexplore.ieee.org
In this work, a spoken query system developed for accessing the price of agricultural
commodities in Assamese language is described. The developed system enables the user …
commodities in Assamese language is described. The developed system enables the user …
Mouth gesture and voice command based robot command interface
In this paper we present a voice command and mouth gesture based robot command
interface which is capable of controlling three degrees of freedom. The gesture set was …
interface which is capable of controlling three degrees of freedom. The gesture set was …
Approximate test risk bound minimization through soft margin estimation
Inspired by the great success of margin-based classifiers, there is a trend to incorporate the
margin concept into hidden Markov modeling for speech recognition. Several attempts …
margin concept into hidden Markov modeling for speech recognition. Several attempts …
[PDF][PDF] Low-memory fast on-line adaptation for acoustically mismatched children's speech recognition.
S Shahnawazuddin, R Sinha - INTERSPEECH, 2015 - isca-archive.org
This work focuses on the issues and the challenges in acoustic adaptation in context of on-
line children's speech recognition. When children's speech is decoded on adults' speech …
line children's speech recognition. When children's speech is decoded on adults' speech …
Low complexity on-line adaptation techniques in context of assamese spoken query system
In this work, we present the development of an Assamese spoken query (SQ) system for
accessing the price of agricultural commodities. The developed system intends to make the …
accessing the price of agricultural commodities. The developed system intends to make the …
A phonetic feature based lattice rescoring approach to LVCSR
SM Siniscalchi, T Svendsen… - 2009 IEEE International …, 2009 - ieeexplore.ieee.org
Large Vocabulary Continuous Speech Recognition (LVCSR) systems decode the input
speech using diverse information sources, such as acoustic, lexical, and linguistic. Although …
speech using diverse information sources, such as acoustic, lexical, and linguistic. Although …
Sparse coding over redundant dictionaries for fast adaptation of speech recognition system
S Shahnawazuddin, R Sinha - Computer Speech & Language, 2017 - Elsevier
This work presents a novel use of the sparse coding over redundant dictionary for fast
adaptation of the acoustic models in the hidden Markov model-based automatic speech …
adaptation of the acoustic models in the hidden Markov model-based automatic speech …