Discriminative training of a phoneme confusion model for a dynamic lexicon in ASR

M Tahon, G Lecorvé, D Lolive - IEEE Transactions on Affective …, 2018 - ieeexplore.ieee.org

In the field of expressive speech synthesis, a lot of work has been conducted on
suprasegmental prosodic features while few has been done on pronunciation variants …

被引用次数：22 相关文章所有 12 个版本

[PDF] wellformedness.com

[PDF][PDF] Discriminative pronunciation modeling for dialectal speech recognition

M Lehr, K Gorman, I Shafran - Fifteenth Annual Conference of …, 2014 - wellformedness.com

Speech recognizers are typically trained with data from a standard dialect and do not
generalize to non-standard dialects. Mismatch mainly occurs in the acoustic realization of …

被引用次数：28 相关文章所有 12 个版本

[PDF] sinica.edu.tw

Deriving disyllabic word variants from a Chinese conversational speech corpus

YF Liu, SC Tseng, JSR Jang - The Journal of the Acoustical Society of …, 2016 - pubs.aip.org

Motivated by the quasi-categorical reduced forms of disyllabic words produced in Chinese
conversational speech, a frequency-based selection procedure of typical pronunciation by …

被引用次数：13 相关文章所有 8 个版本

[PDF] hal.science

Probabilistic speaker pronunciation adaptation for spontaneous speech synthesis using linguistic features

R Qader, G Lecorvé, D Lolive, P Sébillot - Statistical Language and …, 2015 - Springer

Pronunciation adaptation consists in predicting pronunciation variants of words and
utterances based on their standard pronunciation and a target style. This is a key issue in …

被引用次数：10 相关文章所有 10 个版本

[PDF] hal.science

Improving TTS with corpus-specific pronunciation adaptation

M Tahon, R Qader, G Lecorvé, D Lolive - Interspeech, 2016 - inria.hal.science

Text-to-speech (TTS) systems are built on speech corpora which are labeled with carefully
checked and segmented phonemes. However, phoneme sequences generated by …

被引用次数：9 相关文章所有 7 个版本

[PDF] irdta.eu

Optimal feature set and minimal training size for pronunciation adaptation in TTS

M Tahon, R Qader, G Lecorvé, D Lolive - Statistical Language and Speech …, 2016 - Springer

Abstract Text-to-Speech (TTS) systems rely on a grapheme-to-phoneme converter which is
built to produce canonical, or statically stylized, pronunciations. Hence, the TTS quality …

被引用次数：5 相关文章所有 6 个版本

[PDF] limsi.fr

A knowledge-based system for stop consonant identification based on speech spectrogram reading

LF Lamel - Computer Speech & Language, 1993 - Elsevier

In order to formalize the information used in spectrogram reading, a knowledge-based
system for identifying spoken stop consonants was developed. Speech spectrogram reading …

被引用次数：9 相关文章所有 9 个版本

[PDF] hal.science

Traitement automatique de la parole expressive: retour vers des systèmes interprétables?

M Tahon - 2023 - hal.science

La parole est un moyen de communication fondamental qui s' inscrit dans une interaction
entre le locuteur et ses auditeurs. En plus du contenu sémantique, le signal de parole nous …

被引用次数：1 相关文章所有 4 个版本

[PDF] hal.science

Statistical pronunciation adaptation for spontaneous speech synthesis

R Qader, G Lecorvé, D Lolive, M Tahon… - Text, Speech, and …, 2017 - Springer

To bring more expressiveness into text-to-speech systems, this paper presents a new
pronunciation variant generation method which works by adapting standard, ie, dictionary …

被引用次数：4 相关文章所有 6 个版本

[PDF] aclanthology.org

Adaptation de la prononciation pour la synthèse de la parole spontanée en utilisant des informations linguistiques (Pronunciation adaptation for spontaneous speech …

R Qader, G Lecorvé, D Lolive… - Actes de la conférence …, 2016 - aclanthology.org

Cet article présente une nouvelle méthode d'adaptation de la prononciation dont le but est
de reproduire le style spontané. Il s' agit d'une tâche-clé en synthèse de la parole car elle …

被引用次数：2 相关文章所有 8 个版本