ASR2K: Speech recognition for around 2000 languages without audio
Most recent speech recognition models rely on large supervised datasets, which are
unavailable for many low-resource languages. In this work, we present a speech recognition …
unavailable for many low-resource languages. In this work, we present a speech recognition …
What's in the r? A review of the usage of the r symbol in the Illustrations of the IPA
R Anselme, F Pellegrino, D Dediu - Journal of the International …, 2023 - cambridge.org
What does the symbol r mean when it is used in a transcription? Here we analyze the use of
the symbols for the alveolar trills (r) and taps () among the Illustrations of the IPA since 1971 …
the symbols for the alveolar trills (r) and taps () among the Illustrations of the IPA since 1971 …
[PDF][PDF] Hierarchical Phone Recognition with Compositional Phonetics.
There is growing interest in building phone recognition systems for low-resource languages
as the majority of languages do not have any writing systems. Phone recognition systems …
as the majority of languages do not have any writing systems. Phone recognition systems …
Differentiable allophone graphs for language-universal speech recognition
Building language-universal speech recognition systems entails producing phonological
units of spoken sound that can be shared across languages. While speech annotations at …
units of spoken sound that can be shared across languages. While speech annotations at …
Allophant: Cross-lingual Phoneme Recognition with Articulatory Attributes
This paper proposes Allophant, a multilingual phoneme recognizer. It requires only a
phoneme inventory for cross-lingual transfer to a target language, allowing for low-resource …
phoneme inventory for cross-lingual transfer to a target language, allowing for low-resource …
Phone inventories and recognition for every language
Identifying phone inventories is a crucial component in language documentation and the
preservation of endangered languages. However, even the largest collection of phone …
preservation of endangered languages. However, even the largest collection of phone …
Phone based keyword spotting for transcribing very low resource languages
We investigate the efficiency of two very different spoken term detection approaches for
transcription when the available data is insufficient to train a robust speech recognition …
transcription when the available data is insufficient to train a robust speech recognition …
UniGlyph: A Seven-Segment Script for Universal Language Representation
GV Sherin, AA Euphrine, A Moreen, LA Jose - arXiv preprint arXiv …, 2024 - arxiv.org
UniGlyph is a constructed language (conlang) designed to create a universal transliteration
system using a script derived from seven-segment characters. The goal of UniGlyph is to …
system using a script derived from seven-segment characters. The goal of UniGlyph is to …
Phone distribution estimation for low resource languages
Phones are critical components in various computational linguistic fields, for example,
phone distributions could be helpful in speech recognition and speech synthesis. Traditional …
phone distributions could be helpful in speech recognition and speech synthesis. Traditional …
[PDF][PDF] Low-Resource Speech Recognition for Thousands of Languages
X Li - 2023 - kilthub.cmu.edu
Recently, the performance of speech recognition has witnessed rapid improvement due to
modern architectures. Those models typically require thousands of hours of training data for …
modern architectures. Those models typically require thousands of hours of training data for …