Diverse and expressive speech prosody prediction with denoising diffusion probabilistic model

X Li, S Liu, MWY Lam, Z Wu, C Weng… - arXiv preprint arXiv …, 2023 - arxiv.org
Expressive human speech generally abounds with rich and flexible speech prosody
variations. The speech prosody predictors in existing expressive speech synthesis methods …

BaNaVA: A cross-platform AI mobile application for preserving the Bahnaric languages

TQ Thanh, GD Lu, DN Quang… - 2023 IEEE/ACIS 8th …, 2023 - ieeexplore.ieee.org
AI-powered translation is a promising solution to the language barrier faced by the Bahnar
people. However, developing low-resource text-to-speech translation systems is …

Unlocking the Potential: an evaluation of Text-to-Speech Models for the Bahnar Language

GD Lu, TQ Thanh, DN Quang… - 2023 IEEE/ACIS 8th …, 2023 - ieeexplore.ieee.org
The paper aims at evaluating the effectiveness of an AI based mobile application of text-to-
speech models for Bahnar language. In this application, a sequential combination of two …