Sequence-to-sequence voice reconstruction for silent speech in a tonal language

H Li, H Lin, Y Wang, H Wang, M Zhang, H Gao, Q Ai… - Brain Sciences, 2022 - mdpi.com
Silent speech decoding (SSD), based on articulatory neuromuscular activities, has become
a prevalent task of brain–computer interfaces (BCIs) in recent years. Many works have been …

[HTML][HTML] Artificial vocal learning guided by speech recognition: What it may tell us about how children learn to speak

A Xu, DR Van Niekerk, B Gerazov, PK Krug… - Journal of …, 2024 - Elsevier
It has long been a mystery how children learn to speak without formal instructions. Previous
research has used computational modelling to help solve the mystery by simulating vocal …

Encoding of lexical tone in self-supervised models of spoken language

G Shen, M Watkins, A Alishahi, A Bisazza… - arXiv preprint arXiv …, 2024 - arxiv.org
Interpretability research has shown that self-supervised Spoken Language Models (SLMs)
encode a wide variety of features in human speech from the acoustic, phonetic …

Do Discrete Self-Supervised Representations of Speech Capture Tone Distinctions?

O Osakuade, S King - arXiv preprint arXiv:2410.19935, 2024 - arxiv.org
Discrete representations of speech, obtained from Self-Supervised Learning (SSL)
foundation models, are widely used, especially where there are limited data for the …

Tone Recognition of Pahari Language

S Asghar, U Anjum, U Akhter - Journal of Communication and …, 2023 - journals.umt.edu.pk
Pahari is an under-resourced, endangered, and undocumented tonal language, spoken in
Pakistan Administered State of the Azad Jammu and Kashmir (AJK). Preliminary studies …

[PDF][PDF] Parallel Recognition of Mandarin Tones and Focus from continuous F0

Y Chen, Y Xu - Proc. 1st International Conference on Tone and …, 2021 - researchgate.net
In tonal languages not only lexical tones but also prosodic focus can be encoded by
generating F0 contours. Such concurrent encoding of tone and intonation in speech …

Research on Musical Tone Recognition Method Based on Improved RNN for Vocal Music Teaching Network Courses

K Long - International Journal of Web-Based Learning and …, 2023 - igi-global.com
The test results show that the fast Fourier process with multiple time superposition and a
dimension length of 40 is most beneficial to the accuracy of the model. The loss curve value …

[PDF][PDF] Sequence-to-Sequence Voice Reconstruction for Silent Speech in a Tonal Language. Brain Sci. 2022, 12, 818

H Li, H Lin, Y Wang, H Wang, M Zhang, H Gao, Q Ai… - 2022 - academia.edu
Silent speech decoding (SSD), based on articulatory neuromuscular activities, has become
a prevalent task of brain–computer interfaces (BCIs) in recent years. Many works have been …

[PDF][PDF] Mandarin tone production can be learned under perceptual guidance—A machine learning simulation

H Meng, CT Chen, Y Chen, Z Liu, Y Xu - guarant.cz
Intuitively, speech production can be learned by imitating proficient speakers in language
acquisition. But a recent computational simulation has shown that learning to produce …