Diverse and expressive speech prosody prediction with denoising diffusion probabilistic model
Expressive human speech generally abounds with rich and flexible speech prosody
variations. The speech prosody predictors in existing expressive speech synthesis methods …
variations. The speech prosody predictors in existing expressive speech synthesis methods …
BaNaVA: A cross-platform AI mobile application for preserving the Bahnaric languages
TQ Thanh, GD Lu, DN Quang… - 2023 IEEE/ACIS 8th …, 2023 - ieeexplore.ieee.org
AI-powered translation is a promising solution to the language barrier faced by the Bahnar
people. However, developing low-resource text-to-speech translation systems is …
people. However, developing low-resource text-to-speech translation systems is …
Unlocking the Potential: an evaluation of Text-to-Speech Models for the Bahnar Language
GD Lu, TQ Thanh, DN Quang… - 2023 IEEE/ACIS 8th …, 2023 - ieeexplore.ieee.org
The paper aims at evaluating the effectiveness of an AI based mobile application of text-to-
speech models for Bahnar language. In this application, a sequential combination of two …
speech models for Bahnar language. In this application, a sequential combination of two …