Loudness predicts prominence: Fundamental frequency lends little

G Kochanski, E Grabe, J Coleman… - The Journal of the …, 2005 - pubs.aip.org
We explored a database covering seven dialects of British and Irish English and three
different styles of speech to find acoustic correlates of prominence. We built classifiers …

Mispronunciation detection and diagnosis in l2 english speech using multidistribution deep neural networks

K Li, X Qian, H Meng - IEEE/ACM Transactions on Audio …, 2016 - ieeexplore.ieee.org
This paper investigates the use of multidistribution deep neural networks (DNNs) for
mispronunciation detection and diagnosis (MDD), to circumvent the difficulties encountered …

[PDF][PDF] Duration and intensity as perceptual cues for naïve listeners' prominence and boundary perception

Y Mo - Proceedings of Speech Prosody 2008, 2008 - academia.edu
I investigate the acoustic correlates of prosodic prominence and boundary, as they are
perceived by naïve listeners, in spontaneous speech from American English (Buckeye …

Automatic lexical stress and pitch accent detection for L2 English speech using multi-distribution deep neural networks

K Li, S Mao, X Li, Z Wu, H Meng - Speech Communication, 2018 - Elsevier
This paper investigates the use of multi-distribution deep neural networks (MD-DNNs) for
automatic lexical stress detection and pitch accent detection, which are useful for …

[图书][B] Automatic detection and classification of prosodic events

A Rosenberg - 2009 - search.proquest.com
Prosody, or intonation, is a critically important component of spoken communication. The
automatic extraction of prosodic information is necessary for machines to process speech …

Automatic syllable stress detection using prosodic features for pronunciation evaluation of language learners

J Tepperman, S Narayanan - Proceedings.(ICASSP'05). IEEE …, 2005 - ieeexplore.ieee.org
A robust language learning system, designed to help students practice a foreign language
along with a machine tutor, must provide meaningful feedback to users by isolating and …

Intonation classification for L2 English speech using multi-distribution deep neural networks

K Li, X Wu, H Meng - Computer Speech & Language, 2017 - Elsevier
This paper investigates the use of multi-distribution deep neural networks (MD-DNNs) for
automatic intonation classification in second-language (L2) English speech. If a classified …

Predicting children's perceived reading proficiency with prosody modeling

K Sabu, P Rao - Computer Speech & Language, 2024 - Elsevier
Reading is a foundational skill and the focus of school-level education efforts across
countries. The assessment of linguistic competence from oral reading has long been the …

Multi-task deep learning for user intention understanding in speech interaction systems

Y Ning, J Jia, Z Wu, R Li, Y An, Y Wang… - Proceedings of the AAAI …, 2017 - ojs.aaai.org
Speech interaction systems have been gaining popularity in recent years. The main purpose
of these systems is to generate more satisfactory responses according to users' speech …

[PDF][PDF] Lexical stress detection for L2 English speech using deep belief networks.

K Li, X Qian, S Kang, H Meng - Interspeech, 2013 - isca-archive.org
This paper investigates lexical stress detection for L2 English speech using Deep Belief
Networks (DBNs). The features of the DBN used in this work include the syllable-based …