AI-assisted enhancement of student presentation skills: Challenges and opportunities

J Chen, P Lai, A Chan, V Man, CH Chan - Sustainability, 2022 - mdpi.com
Oral presentation is a popular type of assessment in undergraduate degree programs.
However, presentation delivery and grading pose considerable challenges to students and …

A review on speech synthesis based on machine learning

R Kumari, A Dev, A Kumar - International Conference on Artificial …, 2021 - Springer
Recently, Speech synthesis is one of the growing techniques in the research domain that
takes input as text and provides output as acoustical form. The speech synthesis system is …

Towards Developing State-of-The-Art TTS Synthesisers for 13 Indian Languages with Signal Processing Aided Alignments

A Prakash, S Umesh, HA Murthy - 2023 IEEE Automatic Speech …, 2023 - ieeexplore.ieee.org
End-to-end (E2E) systems synthesise high-quality speech, but this typically requires a large
amount of data. As E2E synthesis progressed from Tacotron to FastSpeech2, it became …

Fast and small footprint Hybrid HMM-HiFiGAN based system for speech synthesis in Indian languages

S Srivastava, I Gupta, A Prakash, J Kuriakose… - arXiv preprint arXiv …, 2023 - arxiv.org
Hidden-Markov-model (HMM) based text-to-speech (HTS) offers flexibility in speaking styles
along with fast training and synthesis while being computationally less intense. HTS …

[PDF][PDF] 苗语语音音节自适应切分算法.

冯夫健, 吴磊, 谭棉, 蔡姗, 张学文, 王林 - Science Technology & …, 2024 - stae.com.cn
摘要语音分割是苗语语音基础研究的难点和热点问题, 其本质是苗语语音音节与沉默段(静音,
噪音) 之间边界模糊问题, 目前相关研究成果较少. 针对苗语语音音节分割边界模糊问题 …

An efficient syllable-based speech segmentation model using fuzzy and threshold-based boundary detection

R Kumari, A Dev, A Kumar - International Journal of Computational …, 2022 - World Scientific
To develop a high-quality TTS system, an appropriate segmentation of continuous speech
into the syllabic units plays a vital role. The significant objective of this research work …

Durational and Formantshift characteristics of Telugu alveolar and bilabial nasal phonemes

VKR Maddela, B Peri - 2021 IEEE Mysore Sub Section …, 2021 - ieeexplore.ieee.org
The phonetic-acoustic characteristics of phonemes of a stress-timed language like English
and those of syllable-timed Indian languages, are significantly different. Even within the …

E-TTS: Expressive Text-to-Speech Synthesis for Hindi Using Data Augmentation

I Gupta, HA Murthy - International Conference on Speech and Computer, 2023 - Springer
Current state-of-the-art text-to-speech (TTS) systems trained on read-speech have reduced
issues with repetition or skipping of words and can produce natural-sounding speech …

[PDF][PDF] 基于对数包络的汉语音节切分算法

唐维康, 邵玉斌, 龙华 - Laser & Optoelectronics Progress, 2022 - researching.cn
摘要为提升目前连续汉语音节切分算法在噪声环境中的切分效果, 基于汉语语音的对数包络特征
提出一种音节切分算法, 用曲线插值法获取信号包络, 再经滤波和对数运算获取对数时域包络 …

Speech waveform reconstruction from speech parameters for an effective text to speech synthesis system using minimum phase harmonic sinusoidal model for …

N Kaur, P Singh - Multimedia Tools and Applications, 2022 - Springer
Speech processing plays a vital role in current speech communication applications. The
major objective of digital speech is transmission of messages among human and computer …