ASR2K: Speech recognition for around 2000 languages without audio

X Li, F Metze, DR Mortensen, AW Black… - arXiv preprint arXiv …, 2022 - arxiv.org
Most recent speech recognition models rely on large supervised datasets, which are
unavailable for many low-resource languages. In this work, we present a speech recognition …

What's in the r? A review of the usage of the r symbol in the Illustrations of the IPA

R Anselme, F Pellegrino, D Dediu - Journal of the International …, 2023 - cambridge.org
What does the symbol r mean when it is used in a transcription? Here we analyze the use of
the symbols for the alveolar trills (r) and taps () among the Illustrations of the IPA since 1971 …

[PDF][PDF] Hierarchical Phone Recognition with Compositional Phonetics.

X Li, J Li, F Metze, AW Black - Interspeech, 2021 - isca-archive.org
There is growing interest in building phone recognition systems for low-resource languages
as the majority of languages do not have any writing systems. Phone recognition systems …

Differentiable allophone graphs for language-universal speech recognition

B Yan, S Dalmia, DR Mortensen, F Metze… - arXiv preprint arXiv …, 2021 - arxiv.org
Building language-universal speech recognition systems entails producing phonological
units of spoken sound that can be shared across languages. While speech annotations at …

Allophant: Cross-lingual Phoneme Recognition with Articulatory Attributes

K Glocker, A Herygers, M Georges - arXiv preprint arXiv:2306.04306, 2023 - arxiv.org
This paper proposes Allophant, a multilingual phoneme recognizer. It requires only a
phoneme inventory for cross-lingual transfer to a target language, allowing for low-resource …

Phone inventories and recognition for every language

X Li, F Metze, DR Mortensen, AW Black… - Proceedings of the …, 2022 - aclanthology.org
Identifying phone inventories is a crucial component in language documentation and the
preservation of endangered languages. However, even the largest collection of phone …

Phone based keyword spotting for transcribing very low resource languages

É Le Ferrand, S Bird, L Besacier - 19th Workshop of the …, 2021 - researchers.cdu.edu.au
We investigate the efficiency of two very different spoken term detection approaches for
transcription when the available data is insufficient to train a robust speech recognition …

UniGlyph: A Seven-Segment Script for Universal Language Representation

GV Sherin, AA Euphrine, A Moreen, LA Jose - arXiv preprint arXiv …, 2024 - arxiv.org
UniGlyph is a constructed language (conlang) designed to create a universal transliteration
system using a script derived from seven-segment characters. The goal of UniGlyph is to …

Phone distribution estimation for low resource languages

X Li, J Li, J Yao, AW Black… - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Phones are critical components in various computational linguistic fields, for example,
phone distributions could be helpful in speech recognition and speech synthesis. Traditional …

[PDF][PDF] Low-Resource Speech Recognition for Thousands of Languages

X Li - 2023 - kilthub.cmu.edu
Recently, the performance of speech recognition has witnessed rapid improvement due to
modern architectures. Those models typically require thousands of hours of training data for …