Fast adaptation of deep neural network based on discriminant codes for speech recognition

S Xue, O Abdel-Hamid, H Jiang… - IEEE/ACM Transactions …, 2014 - ieeexplore.ieee.org
Fast adaptation of deep neural networks (DNN) is an important research topic in deep
learning. In this paper, we have proposed a general adaptation scheme for DNN based on …

Using neural network front-ends on far field multiple microphones based speech recognition

Y Liu, P Zhang, T Hain - 2014 IEEE international conference on …, 2014 - ieeexplore.ieee.org
This paper presents an investigation of far field speech recognition using beamforming and
channel concatenation in the context of Deep Neural Network (DNN) based feature …

Integrating Gaussian mixtures into deep neural networks: Softmax layer with hidden variables

Z Tüske, MA Tahir, R Schlüter… - 2015 IEEE International …, 2015 - ieeexplore.ieee.org
In the hybrid approach, neural network output directly serves as hidden Markov model
(HMM) state posterior probability estimates. In contrast to this, in the tandem approach …

An auditory inspired amplitude modulation filter bank for robust feature extraction in automatic speech recognition

N Moritz, J Anemüller… - IEEE/ACM Transactions on …, 2015 - ieeexplore.ieee.org
The human ability to classify acoustic sounds is still unmatched compared to recent methods
in machine learning. Psychoacoustic and physiological studies indicate that the auditory …

Speaker adaptation of hybrid NN/HMM model for speech recognition based on singular value decomposition

S Xue, H Jiang, L Dai, Q Liu - Journal of Signal Processing Systems, 2016 - Springer
Recently several speaker adaptation methods have been proposed for deep neural network
(DNN) in many large vocabulary continuous speech recognition (LVCSR) tasks. However …

Multilingual MRASTA features for low-resource keyword search and speech recognition systems

Z Tüske, D Nolden, R Schlüter… - 2014 IEEE International …, 2014 - ieeexplore.ieee.org
This paper investigates the application of hierarchical MRASTA bottleneck (BN) features for
under-resourced languages within the IARPA Babel project. Through multilingual training of …

Speaker adaptive joint training of gaussian mixture models and bottleneck features

Z Tüske, P Golik, R Schlüter… - 2015 IEEE Workshop on …, 2015 - ieeexplore.ieee.org
In the tandem approach, the output of a neural network (NN) serves as input features to a
Gaussian mixture model (GMM) aiming to improve the emission probability estimates. As …

[PDF][PDF] Multilingual hierarchical MRASTA features for ASR.

Z Tüske, R Schlüter, H Ney - Interspeech, 2013 - www-i6.informatik.rwth-aachen.de
Abstract Recently, a multilingual Multi Layer Perceptron (MLP) training method was
introduced without having to explicitly map the phonetic units of multiple languages to a …

[PDF][PDF] Multilingual features based keyword search for very low-resource languages.

P Golik, Z Tüske, R Schlüter, H Ney - Interspeech, 2015 - academia.edu
In this paper we describe RWTH Aachen's system for keyword search (KWS) with very
limited amount of transcribed audio data available in the target language. This setting has …

[PDF][PDF] Should deep neural nets have ears? the role of auditory features in deep learning approaches.

AMC Martinez, N Moritz, BT Meyer - Interspeech, 2014 - isca-archive.org
Features inspired by the auditory system have previously demonstrated improvement in
automatic speech recognition (ASR). Similarly, the use of Deep Neural Networks (DNN) was …