Self-paced curriculum learning

L Jiang, D Meng, Q Zhao, S Shan… - Proceedings of the AAAI …, 2015 - ojs.aaai.org
Curriculum learning (CL) or self-paced learning (SPL) represents a recently proposed
learning regime inspired by the learning process of humans and animals that gradually …

[PDF][PDF] Towards speaker adaptive training of deep neural network acoustic models.

Y Miao, H Zhang, F Metze - Interspeech, 2014 - isca-archive.org
We investigate the concept of speaker adaptive training (SAT) in the context of deep neural
network (DNN) acoustic models. Previous studies have shown success of performing …

[PDF][PDF] On speaker adaptation of long short-term memory recurrent neural networks

Y Miao, F Metze - Sixteenth Annual Conference of the International …, 2015 - isca-archive.org
Abstract Long Short-Term Memory (LSTM) is a recurrent neural network (RNN) architecture
specializing in modeling long-range temporal dynamics. On acoustic modeling tasks, LSTM …

Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion

J Tejedor, DT Toledano, P Lopez-Otero… - EURASIP Journal on …, 2015 - Springer
Spoken term detection (STD) aims at retrieving data from a speech repository given a textual
representation of the search term. Nowadays, it is receiving much interest due to the large …

The multi-domain international search on speech 2020 albayzin evaluation: Overview, systems, results, discussion and post-evaluation analyses

J Tejedor, DT Toledano, JM Ramirez, AR Montalvo… - Applied Sciences, 2021 - mdpi.com
The large amount of information stored in audio and video repositories makes search on
speech (SoS) a challenging area that is continuously receiving much interest. Within SoS …

[PDF][PDF] Distributed learning of multilingual DNN feature extractors using GPUs

Y Miao, H Zhang, F Metze - Fifteenth Annual Conference of the …, 2014 - isca-archive.org
Multilingual deep neural networks (DNNs) can act as deep feature extractors and have been
applied successfully to crosslanguage acoustic modeling. Learning these feature extractors …

[PDF][PDF] Improving language-universal feature extraction with deep maxout and convolutional neural networks

Y Miao, F Metze - Fifteenth Annual Conference of the International …, 2014 - isca-archive.org
When deployed in automated speech recognition (ASR), deep neural networks (DNNs) can
be treated as a complex feature extractor plus a simple linear classifier. Previous work has …

Albayzin 2016 spoken term detection evaluation: an international open competitive evaluation in spanish

J Tejedor, DT Toledano, P Lopez-Otero… - EURASIP Journal on …, 2017 - Springer
Within search-on-speech, Spoken Term Detection (STD) aims to retrieve data from a speech
repository given a textual representation of a search term. This paper presents an …

[PDF][PDF] Graph-based re-ranking using acoustic feature similarity between search results for spoken term detection on low-resource languages

H Lee, Y Zhang, E Chuangsuwanich… - … Annual Conference of …, 2014 - isca-archive.org
Acoustic feature similarity between search results has been shown to be very helpful for the
task of spoken term detection (STD). A graph-based re-ranking approach for STD has been …

Using word burst analysis to rescore keyword search candidates on low-resource languages

J Richards, M Ma, A Rosenberg - 2014 IEEE International …, 2014 - ieeexplore.ieee.org
For low-resource languages, keyword search (KWS) remains challenging due to the lack of
training data. This work aims to bolster KWS performance in low-resource languages by …