[PDF][PDF] Hierarchical Phone Recognition with Compositional Phonetics.

X Li, J Li, F Metze, AW Black - Interspeech, 2021 - isca-archive.org
There is growing interest in building phone recognition systems for low-resource languages
as the majority of languages do not have any writing systems. Phone recognition systems …

Allophant: Cross-lingual Phoneme Recognition with Articulatory Attributes

K Glocker, A Herygers, M Georges - arXiv preprint arXiv:2306.04306, 2023 - arxiv.org
This paper proposes Allophant, a multilingual phoneme recognizer. It requires only a
phoneme inventory for cross-lingual transfer to a target language, allowing for low-resource …

The taste of IPA: Towards open-vocabulary keyword spotting and forced alignment in any language

J Zhu, C Yang, F Samir, J Islam - … of the 2024 Conference of the …, 2024 - aclanthology.org
In this project, we demonstrate that phoneme-based models for speech processing can
achieve strong crosslinguistic generalizability to unseen languages. We curated the …

Hierarchical softmax for end-to-end low-resource multilingual speech recognition

Q Liu, Z Gong, Z Yang, Y Yang, S Li… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Low-resource speech recognition has been long-suffering from insufficient training data. In
this paper, we propose an approach that leverages neighboring languages to improve low …

Phone inventories and recognition for every language

X Li, F Metze, DR Mortensen, AW Black… - Proceedings of the …, 2022 - aclanthology.org
Identifying phone inventories is a crucial component in language documentation and the
preservation of endangered languages. However, even the largest collection of phone …

Efficiently Identifying Low-Quality Language Subsets in Multilingual Datasets: A Case Study on a Large-Scale Multilingual Audio Dataset

F Samir, EP Ahn, S Prakash, M Soskuthy… - arXiv preprint arXiv …, 2024 - arxiv.org
Curating datasets that span multiple languages is challenging. To make the collection more
scalable, researchers often incorporate one or more imperfect classifiers in the process, like …

Open-vocabulary keyword spotting in any language through multilingual contrastive speech-phoneme pretraining

J Zhu, F Samir, C Yang, J Islam - arXiv preprint arXiv:2311.08323, 2023 - arxiv.org
In this paper, we introduce a massively multilingual speech corpora with fine-grained
phonemic transcriptions, encompassing more than 115 languages from diverse language …

[PDF][PDF] Low-Resource Speech Recognition for Thousands of Languages

X Li - 2023 - kilthub.cmu.edu
Recently, the performance of speech recognition has witnessed rapid improvement due to
modern architectures. Those models typically require thousands of hours of training data for …

Tusom2021: A phonetically transcribed speech dataset from an endangered language for universal phone recognition experiments

DR Mortensen, J Picone, X Li, K Siminyu - arXiv preprint arXiv:2104.00824, 2021 - arxiv.org
There is growing interest in ASR systems that can recognize phones in a language-
independent fashion. There is additionally interest in building language technologies for low …

Developing Nigeria Multilingual Languages Speech Datasets for Antenatal Orientation

SA Ajagbe - International Conference on Applied Informatics, 2024 - Springer
Nigerian native languages can still be classified as under-resourced concerning the text and
speech resources required for technology development. This limitation strikes all human …