Unsupervised speech recognition
Despite rapid progress in the recent past, current speech recognition systems still require
labeled training data which limits this technology to a small fraction of the languages spoken …
labeled training data which limits this technology to a small fraction of the languages spoken …
Pushing the limits of semi-supervised learning for automatic speech recognition
We employ a combination of recent developments in semi-supervised learning for automatic
speech recognition to obtain state-of-the-art results on LibriSpeech utilizing the unlabeled …
speech recognition to obtain state-of-the-art results on LibriSpeech utilizing the unlabeled …
[PDF][PDF] Semi-orthogonal low-rank matrix factorization for deep neural networks.
Abstract Time Delay Neural Networks (TDNNs), also known as onedimensional
Convolutional Neural Networks (1-d CNNs), are an efficient and well-performing neural …
Convolutional Neural Networks (1-d CNNs), are an efficient and well-performing neural …
TED-LIUM 3: Twice as much data and corpus repartition for experiments on speaker adaptation
In this paper, we present TED-LIUM release 3 corpus (TED-LIUM 3 is available on
https://lium. univ-lemans. fr/ted-lium3/) dedicated to speech recognition in English, which …
https://lium. univ-lemans. fr/ted-lium3/) dedicated to speech recognition in English, which …
Towards end-to-end unsupervised speech recognition
Unsupervised speech recognition has shown great potential to make Automatic Speech
Recognition (ASR) systems accessible to every language. However, existing methods still …
Recognition (ASR) systems accessible to every language. However, existing methods still …
Quantifying bias in automatic speech recognition
Automatic speech recognition (ASR) systems promise to deliver objective interpretation of
human speech. Practice and recent evidence suggests that the state-of-the-art (SotA) ASRs …
human speech. Practice and recent evidence suggests that the state-of-the-art (SotA) ASRs …
[HTML][HTML] Towards inclusive automatic speech recognition
Practice and recent evidence show that state-of-the-art (SotA) automatic speech recognition
(ASR) systems do not perform equally well for all speaker groups. Many factors can cause …
(ASR) systems do not perform equally well for all speaker groups. Many factors can cause …
A pruned rnnlm lattice-rescoring algorithm for automatic speech recognition
Lattice-rescoring is a common approach to take advantage of recurrent neural language
models in ASR, where a word-lattice is generated from 1st-pass decoding and the lattice is …
models in ASR, where a word-lattice is generated from 1st-pass decoding and the lattice is …
Speech enhancement based on teacher–student deep learning using improved speech presence probability for noise-robust speech recognition
In this paper, we propose a novel teacher-student learning framework for the preprocessing
of a speech recognizer, leveraging the online noise tracking capabilities of improved minima …
of a speech recognizer, leveraging the online noise tracking capabilities of improved minima …
Advanced rich transcription system for Estonian speech
This paper describes the current TTÜ speech transcription system for Estonian speech. The
system is designed to handle semi-spontaneous speech, such as broadcast conversations …
system is designed to handle semi-spontaneous speech, such as broadcast conversations …