Computer-assisted pronunciation training: From pronunciation scoring towards spoken language learning
This paper reviews the research approaches used in computer-assisted pronunciation
training (CAPT), addresses the existing challenges, and discusses emerging trends and …
training (CAPT), addresses the existing challenges, and discusses emerging trends and …
CNN-RNN-CTC based end-to-end mispronunciation detection and diagnosis
This paper focuses on using Convolutional Neural Network (CNN), Recurrent Neural
Network (RNN) and Connection-ist Temporal Classification (CTC) to build an end-to-end …
Network (RNN) and Connection-ist Temporal Classification (CTC) to build an end-to-end …
An end-to-end mispronunciation detection system for L2 English speech leveraging novel anti-phone modeling
Mispronunciation detection and diagnosis (MDD) is a core component of computer-assisted
pronunciation training (CAPT). Most of the existing MDD approaches focus on dealing with …
pronunciation training (CAPT). Most of the existing MDD approaches focus on dealing with …
[HTML][HTML] Large-scale characterization of non-native Mandarin Chinese spoken by speakers of European origin: Analysis on iCALL
In this work, we analyze phonetic and prosodic pronunciation patterns from iCALL, a speech
corpus designed to evaluate Mandarin mispronunciations by non-native speakers of …
corpus designed to evaluate Mandarin mispronunciations by non-native speakers of …
Improving mispronunciation detection with wav2vec2-based momentum pseudo-labeling for accentedness and intelligibility assessment
Current leading mispronunciation detection and diagnosis (MDD) systems achieve
promising performance via end-to-end phoneme recognition. One challenge of such end-to …
promising performance via end-to-end phoneme recognition. One challenge of such end-to …
Non-native children speech recognition through transfer learning
M Matassoni, R Gretter, D Falavigna… - … on Acoustics, Speech …, 2018 - ieeexplore.ieee.org
This work deals with non-native children's speech and investigates both multi-task and
transfer learning approaches to adapt a multi-language Deep Neural Network (DNN) to …
transfer learning approaches to adapt a multi-language Deep Neural Network (DNN) to …
TLT-school: a corpus of non native children speech
R Gretter, M Matassoni, S Bannò… - arXiv preprint arXiv …, 2020 - arxiv.org
This paper describes" TLT-school" a corpus of speech utterances collected in schools of
northern Italy for assessing the performance of students learning both English and German …
northern Italy for assessing the performance of students learning both English and German …
Towards robust mispronunciation detection and diagnosis for L2 English learners with accent-modulating methods
With the acceleration of globalization, more and more people are willing or required to learn
second languages (L2). One of the major remaining challenges facing current …
second languages (L2). One of the major remaining challenges facing current …
Mispronunciation detection and diagnosis using deep neural networks: a systematic review
The increased need for foreign language learning, along with advances in speech
technology have heightened interest in computer-assisted pronunciation teaching (CAPT) …
technology have heightened interest in computer-assisted pronunciation teaching (CAPT) …
Cross-lingual transfer learning of non-native acoustic modeling for pronunciation error detection and diagnosis
R Duan, T Kawahara, M Dantsuji… - IEEE/ACM Transactions …, 2019 - ieeexplore.ieee.org
In computer-assisted pronunciation training (CAPT), the scarcity of large-scale non-native
corpora and human expert annotations are two fundamental challenges to non-native …
corpora and human expert annotations are two fundamental challenges to non-native …