[PDF][PDF] Hierarchical Phone Recognition with Compositional Phonetics.
There is growing interest in building phone recognition systems for low-resource languages
as the majority of languages do not have any writing systems. Phone recognition systems …
as the majority of languages do not have any writing systems. Phone recognition systems …
Allophant: Cross-lingual Phoneme Recognition with Articulatory Attributes
This paper proposes Allophant, a multilingual phoneme recognizer. It requires only a
phoneme inventory for cross-lingual transfer to a target language, allowing for low-resource …
phoneme inventory for cross-lingual transfer to a target language, allowing for low-resource …
The taste of IPA: Towards open-vocabulary keyword spotting and forced alignment in any language
In this project, we demonstrate that phoneme-based models for speech processing can
achieve strong crosslinguistic generalizability to unseen languages. We curated the …
achieve strong crosslinguistic generalizability to unseen languages. We curated the …
Hierarchical softmax for end-to-end low-resource multilingual speech recognition
Low-resource speech recognition has been long-suffering from insufficient training data. In
this paper, we propose an approach that leverages neighboring languages to improve low …
this paper, we propose an approach that leverages neighboring languages to improve low …
Phone inventories and recognition for every language
Identifying phone inventories is a crucial component in language documentation and the
preservation of endangered languages. However, even the largest collection of phone …
preservation of endangered languages. However, even the largest collection of phone …
Efficiently Identifying Low-Quality Language Subsets in Multilingual Datasets: A Case Study on a Large-Scale Multilingual Audio Dataset
Curating datasets that span multiple languages is challenging. To make the collection more
scalable, researchers often incorporate one or more imperfect classifiers in the process, like …
scalable, researchers often incorporate one or more imperfect classifiers in the process, like …
Open-vocabulary keyword spotting in any language through multilingual contrastive speech-phoneme pretraining
In this paper, we introduce a massively multilingual speech corpora with fine-grained
phonemic transcriptions, encompassing more than 115 languages from diverse language …
phonemic transcriptions, encompassing more than 115 languages from diverse language …
[PDF][PDF] Low-Resource Speech Recognition for Thousands of Languages
X Li - 2023 - kilthub.cmu.edu
Recently, the performance of speech recognition has witnessed rapid improvement due to
modern architectures. Those models typically require thousands of hours of training data for …
modern architectures. Those models typically require thousands of hours of training data for …
Tusom2021: A phonetically transcribed speech dataset from an endangered language for universal phone recognition experiments
DR Mortensen, J Picone, X Li, K Siminyu - arXiv preprint arXiv:2104.00824, 2021 - arxiv.org
There is growing interest in ASR systems that can recognize phones in a language-
independent fashion. There is additionally interest in building language technologies for low …
independent fashion. There is additionally interest in building language technologies for low …
Developing Nigeria Multilingual Languages Speech Datasets for Antenatal Orientation
SA Ajagbe - International Conference on Applied Informatics, 2024 - Springer
Nigerian native languages can still be classified as under-resourced concerning the text and
speech resources required for technology development. This limitation strikes all human …
speech resources required for technology development. This limitation strikes all human …