Unsupervised word-level prosody tagging for controllable speech synthesis

Y Guo, C Du, K Yu - ICASSP 2022-2022 IEEE International …, 2022 - ieeexplore.ieee.org
Although word-level prosody modeling in neural text-to-speech (TTS) has been investigated
in recent research for diverse speech synthesis, it is still challenging to control speech …

Formal concepts and methods fostering creative thinking in digital game design

J Winter, KP Jantke - 2014 IEEE 3rd Global Conference on …, 2014 - ieeexplore.ieee.org
Serious design of digital games is aiming at impact. Humans players may be affected by
experiences in virtual worlds. Game designers anticipate potential experiences …

Automatic phrase segmentation and clustering in spontaneous speech

A Beke, G Szaszák, V Váradi - 2013 IEEE 4th International …, 2013 - ieeexplore.ieee.org
The aim of this research is to segment spontaneous speech using an unsupervised learning
technique. We are especially interested from a machine perception or detection point-of …

Patterns-The key to game amusement studies

KP Jantke, O Arnold - 2014 IEEE 3rd Global Conference on …, 2014 - ieeexplore.ieee.org
Studies of digital games bridge the gap from game design and technology to the impact of
play, thus, being highly interdisciplinary by nature. The scientific discourse is difficult. Pattern …

Toward Exploring the Role of Disfluencies from an Acoustic Point of View: A New Aspect of (Dis) continuous Speech Prosody Modelling

G Szaszák, A Beke - Text, Speech, and Dialogue: 18th International …, 2015 - Springer
Several studies use idealized, fluent utterances to comprehend spoken language.
Disfluencies are often regarded to be just a noise in the speech flow. Other works argue that …

[PDF][PDF] Analyzing f0 discontinuity for speech prosody enhancement

G Szaszák, MG Tulics, MA Tündik - Acta Univ. Sapientiae Elect …, 2014 - researchgate.net
This letter is interested in assessing the pros and contras of using an overall continuous
versus a disrupted, not overall defined F0 estimate and compare formal and informal speech …

Evaluation of Energy and Duration on Malay Phrase Breaks

HM Hanum, ZA Bakar - 2015 9th Asia Modelling Symposium …, 2015 - ieeexplore.ieee.org
This paper presents evaluation of energy and duration features in detection of phrase
breaks. The training feature set is developed from evaluation of targeted phrase break in …

Prosodic breaks on Malay speech corpus: Evaluation of pitch, intensity and duration

HM Hanum, S Nasaruddin… - 2016 Third International …, 2016 - ieeexplore.ieee.org
Prosodic phrasing is useful to segment lengthy spontaneous speech into smaller meaningful
utterance without analysis of linguistic information. A simpler approach is presented to …

[PDF][PDF] Sentence segmentation and phrase strength estimation in Malay continuous speech

HM Hanum, ZA Bakar - … of the International Conference on Speech …, 2016 - isca-archive.org
Continuous speech sentences are delivered in several shorter phrasing segments which
can be considered as units of information. The paper proposes a technique to improve …

[PDF][PDF] Detection of Malay phrase breaks using energy and duration

HM Hanum, ZA Bakar - International Journal of Simulation …, 2016 - researchgate.net
A simpler approach to identify and classify phrase breaks in prosodic phrasing using energy
patterns and duration is useful in speech segmentation. Prosodic phrasing is useful to …