Sequence-to-sequence voice reconstruction for silent speech in a tonal language
H Li, H Lin, Y Wang, H Wang, M Zhang, H Gao, Q Ai… - Brain Sciences, 2022 - mdpi.com
Silent speech decoding (SSD), based on articulatory neuromuscular activities, has become
a prevalent task of brain–computer interfaces (BCIs) in recent years. Many works have been …
a prevalent task of brain–computer interfaces (BCIs) in recent years. Many works have been …
[HTML][HTML] Artificial vocal learning guided by speech recognition: What it may tell us about how children learn to speak
It has long been a mystery how children learn to speak without formal instructions. Previous
research has used computational modelling to help solve the mystery by simulating vocal …
research has used computational modelling to help solve the mystery by simulating vocal …
Encoding of lexical tone in self-supervised models of spoken language
Interpretability research has shown that self-supervised Spoken Language Models (SLMs)
encode a wide variety of features in human speech from the acoustic, phonetic …
encode a wide variety of features in human speech from the acoustic, phonetic …
Do Discrete Self-Supervised Representations of Speech Capture Tone Distinctions?
O Osakuade, S King - arXiv preprint arXiv:2410.19935, 2024 - arxiv.org
Discrete representations of speech, obtained from Self-Supervised Learning (SSL)
foundation models, are widely used, especially where there are limited data for the …
foundation models, are widely used, especially where there are limited data for the …
Tone Recognition of Pahari Language
S Asghar, U Anjum, U Akhter - Journal of Communication and …, 2023 - journals.umt.edu.pk
Pahari is an under-resourced, endangered, and undocumented tonal language, spoken in
Pakistan Administered State of the Azad Jammu and Kashmir (AJK). Preliminary studies …
Pakistan Administered State of the Azad Jammu and Kashmir (AJK). Preliminary studies …
[PDF][PDF] Parallel Recognition of Mandarin Tones and Focus from continuous F0
Y Chen, Y Xu - Proc. 1st International Conference on Tone and …, 2021 - researchgate.net
In tonal languages not only lexical tones but also prosodic focus can be encoded by
generating F0 contours. Such concurrent encoding of tone and intonation in speech …
generating F0 contours. Such concurrent encoding of tone and intonation in speech …
Research on Musical Tone Recognition Method Based on Improved RNN for Vocal Music Teaching Network Courses
K Long - International Journal of Web-Based Learning and …, 2023 - igi-global.com
The test results show that the fast Fourier process with multiple time superposition and a
dimension length of 40 is most beneficial to the accuracy of the model. The loss curve value …
dimension length of 40 is most beneficial to the accuracy of the model. The loss curve value …
[PDF][PDF] Sequence-to-Sequence Voice Reconstruction for Silent Speech in a Tonal Language. Brain Sci. 2022, 12, 818
H Li, H Lin, Y Wang, H Wang, M Zhang, H Gao, Q Ai… - 2022 - academia.edu
Silent speech decoding (SSD), based on articulatory neuromuscular activities, has become
a prevalent task of brain–computer interfaces (BCIs) in recent years. Many works have been …
a prevalent task of brain–computer interfaces (BCIs) in recent years. Many works have been …
[PDF][PDF] Mandarin tone production can be learned under perceptual guidance—A machine learning simulation
H Meng, CT Chen, Y Chen, Z Liu, Y Xu - guarant.cz
Intuitively, speech production can be learned by imitating proficient speakers in language
acquisition. But a recent computational simulation has shown that learning to produce …
acquisition. But a recent computational simulation has shown that learning to produce …