Data augmentation for deep neural network acoustic modeling

X Cui, V Goel, B Kingsbury - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org
This paper investigates data augmentation for deep neural network acoustic modeling
based on label-preserving transformations to deal with data sparsity. Two data …

System combination and score normalization for spoken term detection

J Mamou, J Cui, X Cui, MJF Gales… - … , Speech and Signal …, 2013 - ieeexplore.ieee.org
Spoken content in languages of emerging importance needs to be searchable to provide
access to the underlying information. In this paper, we investigate the problem of extending …

Developing speech recognition systems for corpus indexing under the IARPA Babel program

J Cui, X Cui, B Ramabhadran, J Kim… - … , Speech and Signal …, 2013 - ieeexplore.ieee.org
Automatic speech recognition is a core component of many applications, including keyword
search. In this paper we describe experiments on acoustic modeling, language modeling …

A high-performance Cantonese keyword search system

B Kingsbury, J Cui, X Cui, MJF Gales… - … , Speech and Signal …, 2013 - ieeexplore.ieee.org
We present a system for keyword search on Cantonese conversational telephony audio,
collected for the IARPA Babel program, that achieves good performance by combining …

Stochastic modelling as a tool for seismic signals segmentation

D Kucharczyk, A Wyłomańska… - Shock and …, 2016 - Wiley Online Library
In order to model nonstationary real‐world processes one can find appropriate theoretical
model with properties following the analyzed data. However in this case many trajectories of …

Ensemble learning approaches in speech recognition

Y Zhao, J Xue, X Chen - Speech and audio processing for coding …, 2014 - Springer
An overview is made on the ensemble learning efforts that have emerged in automatic
speech recognition in recent years. The approaches that are based on different machine …

[PDF][PDF] Improving deep neural network acoustic modeling for audio corpus indexing under the iarpa babel program

X Cui, B Kingsbury, J Cui… - … Annual Conference of …, 2014 - researchgate.net
This paper is focused on several techniques that improve deep neural network (DNN)
acoustic modeling for audio corpus indexing in the context of the IARPA Babel program …

[PDF][PDF] Recent improvements in neural network acoustic modeling for LVCSR in low resource languages

J Cui, B Ramabhadran, X Cui… - … Annual Conference of …, 2014 - isca-archive.org
In this paper we focus on several techniques that improve deep neural network (DNN)
acoustic modeling for low-resource languages. We explore the use of different features such …

Towards automatic cross-lingual acoustic modelling applied to HMM-based speech synthesis for under-resourced languages

T Justin, F Mihelič, J Žibert - automatika, 2016 - Taylor & Francis
Nowadays Human Computer Interaction (HCI) can also be achieved with voice user
interfaces (VUIs). To enable devices to communicate with humans by speech in the user's …

[PDF][PDF] Signal Processing Cues to Improve Automatic Speech Recognition for Low Resource Indian Languages.

A Baby, K Pandia, HA Murthy - SLTU, 2018 - isca-archive.org
Building accurate acoustic models for low resource languages is the focus of this paper.
Acoustic models are likely to be accurate provided the phone boundaries are determined …