Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech

文章

学术资源搜索

获得 4 条结果（用时0.02秒）

我的图书馆

Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech

在引用文章中搜索

[PDF] arxiv.org

Pam: Prompting audio-language models for audio quality assessment

S Deshmukh, D Alharthi, B Elizalde, H Gamper… - arXiv preprint arXiv …, 2024 - arxiv.org

While audio quality is a key performance metric for various audio processing tasks, including
generative modeling, its objective measurement remains a challenge. Audio-Language …

被引用次数：11 相关文章所有 2 个版本

[PDF] arxiv.org

DMDSpeech: Distilled Diffusion Model Surpassing The Teacher in Zero-shot Speech Synthesis via Direct Metric Optimization

YA Li, R Kumar, Z Jin - arXiv preprint arXiv:2410.11097, 2024 - arxiv.org

Diffusion models have demonstrated significant potential in speech synthesis tasks,
including text-to-speech (TTS) and voice cloning. However, their iterative denoising …

[PDF] isca-archive.org

[PDF][PDF] Exploring the Accuracy of Prosodic Encodings in State-of-the-Art Text-to-Speech Models

C Chan, J Kuang - Proc. SpeechProsody 2024, 2024 - isca-archive.org

Modern speech synthesis models have achieved increasingly humanlike outputs, and have
particularly been shown to be practically indistinguishable from natural speech at the phone …

[PDF][PDF] Open-Source Multispeaker Text-to-Speech Model and Synthetic Speech Corpus with a Mexican Accent through a Web Spanish Dictionary

CDH Mena, JO Giraldo, IB de la Pena, A Medina… - isca-archive.org

Abstract Although European Spanish has abundant resources in the speech field, ASR
systems often struggle with Spanish of other world regions. Improving ASR accuracy can be …