Self-paced curriculum learning
Curriculum learning (CL) or self-paced learning (SPL) represents a recently proposed
learning regime inspired by the learning process of humans and animals that gradually …
learning regime inspired by the learning process of humans and animals that gradually …
[PDF][PDF] Towards speaker adaptive training of deep neural network acoustic models.
We investigate the concept of speaker adaptive training (SAT) in the context of deep neural
network (DNN) acoustic models. Previous studies have shown success of performing …
network (DNN) acoustic models. Previous studies have shown success of performing …
[PDF][PDF] On speaker adaptation of long short-term memory recurrent neural networks
Abstract Long Short-Term Memory (LSTM) is a recurrent neural network (RNN) architecture
specializing in modeling long-range temporal dynamics. On acoustic modeling tasks, LSTM …
specializing in modeling long-range temporal dynamics. On acoustic modeling tasks, LSTM …
Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion
J Tejedor, DT Toledano, P Lopez-Otero… - EURASIP Journal on …, 2015 - Springer
Spoken term detection (STD) aims at retrieving data from a speech repository given a textual
representation of the search term. Nowadays, it is receiving much interest due to the large …
representation of the search term. Nowadays, it is receiving much interest due to the large …
The multi-domain international search on speech 2020 albayzin evaluation: Overview, systems, results, discussion and post-evaluation analyses
The large amount of information stored in audio and video repositories makes search on
speech (SoS) a challenging area that is continuously receiving much interest. Within SoS …
speech (SoS) a challenging area that is continuously receiving much interest. Within SoS …
[PDF][PDF] Distributed learning of multilingual DNN feature extractors using GPUs
Multilingual deep neural networks (DNNs) can act as deep feature extractors and have been
applied successfully to crosslanguage acoustic modeling. Learning these feature extractors …
applied successfully to crosslanguage acoustic modeling. Learning these feature extractors …
[PDF][PDF] Improving language-universal feature extraction with deep maxout and convolutional neural networks
When deployed in automated speech recognition (ASR), deep neural networks (DNNs) can
be treated as a complex feature extractor plus a simple linear classifier. Previous work has …
be treated as a complex feature extractor plus a simple linear classifier. Previous work has …
Albayzin 2016 spoken term detection evaluation: an international open competitive evaluation in spanish
J Tejedor, DT Toledano, P Lopez-Otero… - EURASIP Journal on …, 2017 - Springer
Within search-on-speech, Spoken Term Detection (STD) aims to retrieve data from a speech
repository given a textual representation of a search term. This paper presents an …
repository given a textual representation of a search term. This paper presents an …
[PDF][PDF] Graph-based re-ranking using acoustic feature similarity between search results for spoken term detection on low-resource languages
Acoustic feature similarity between search results has been shown to be very helpful for the
task of spoken term detection (STD). A graph-based re-ranking approach for STD has been …
task of spoken term detection (STD). A graph-based re-ranking approach for STD has been …
Using word burst analysis to rescore keyword search candidates on low-resource languages
J Richards, M Ma, A Rosenberg - 2014 IEEE International …, 2014 - ieeexplore.ieee.org
For low-resource languages, keyword search (KWS) remains challenging due to the lack of
training data. This work aims to bolster KWS performance in low-resource languages by …
training data. This work aims to bolster KWS performance in low-resource languages by …