An empirical evaluation of zero resource acoustic unit discovery

J Kahn, M Riviere, W Zheng… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org

We introduce a new collection of spoken English audio suitable for training speech
recognition systems under limited or no supervision. It is derived from open-source audio …

被引用次数：662 相关文章所有 13 个版本

[PDF] a-star.edu.sg

Multilingual bottle-neck feature learning from untranscribed speech

H Chen, CC Leung, L Xie, B Ma… - 2017 IEEE Automatic …, 2017 - ieeexplore.ieee.org

We propose to learn a low-dimensional feature representation for multiple languages
without access to their manual transcription. The multilingual features are extracted from a …

被引用次数：35 相关文章所有 2 个版本

[PDF] researchgate.net

Sinhala and tamil speech intent identification from english phoneme based asr

Y Karunanayake, U Thayasivam… - … Conference on Asian …, 2019 - ieeexplore.ieee.org

Today we can find many use cases for content-based speech classification. These include
speech topic identification and spoken command recognition. Automatic Speech …

被引用次数：20 相关文章所有 2 个版本

[PDF] arxiv.org

Topic identification for speech without asr

C Liu, J Trmal, M Wiesner, C Harman… - arXiv preprint arXiv …, 2017 - arxiv.org

Modern topic identification (topic ID) systems for speech use automatic speech recognition
(ASR) to produce speech transcripts, and perform supervised classification on such ASR …

被引用次数：22 相关文章所有 6 个版本

[PDF] arxiv.org

Automatic speech recognition and topic identification for almost-zero-resource languages

M Wiesner, C Liu, L Ondel, C Harman… - arXiv preprint arXiv …, 2018 - arxiv.org

Automatic speech recognition (ASR) systems often need to be developed for extremely low-
resource languages to serve end-uses such as audio content categorization and search …

被引用次数：16 相关文章所有 10 个版本

[PDF] a-star.edu.sg

Multitask feature learning for low-resource query-by-example spoken term detection

H Chen, CC Leung, L Xie, B Ma… - IEEE Journal of Selected …, 2017 - ieeexplore.ieee.org

We propose a novel technique that learns a low-dimensional feature representation from
unlabeled data of a target language, and labeled data from a nontarget language. The …

被引用次数：15 相关文章所有 4 个版本

Automatic sub-word unit discovery and pronunciation lexicon induction for ASR with application to under-resourced languages

W Agenbag, T Niesler - Computer Speech & Language, 2019 - Elsevier

We present a method enabling the unsupervised discovery of sub-word units (SWUs) and
associated pronunciation lexicons for use in automatic speech recognition (ASR) systems …

被引用次数：8 相关文章所有 2 个版本

Bottom-up unsupervised word discovery via acoustic units

S Bhati, C Liu, J Villalba, J Trmal… - 2019 IEEE Global …, 2019 - ieeexplore.ieee.org

Unsupervised term discovery is the task of identifying and grouping reoccurring word-like
patterns from the untranscribed audio data. It facilitates unsupervised acoustic model …

被引用次数：3 相关文章

[PDF] festvox.org

[PDF][PDF] Unsupervised Phonetic and Word Level Discovery for Speech to Speech Translation for Unwritten Languages.

S Hillis, AP Kumar, AW Black - INTERSPEECH, 2019 - festvox.org

We experiment with unsupervised methods for deriving and clustering symbolic
representations of speech, working towards speech-to-speech translation for languages …

被引用次数：3 相关文章所有 7 个版本

[PDF] researchgate.net

[PDF][PDF] Leveraging text data for word segmentation for underresourced languages

T Glarner, B Boenninghoff, O Walter, R Haeb-Umbach - System, 2017 - researchgate.net

In this contribution we show how to exploit text data to support word discovery from audio
input in an underresourced target language. Given audio, of which a certain amount is …

被引用次数：4 相关文章所有 7 个版本