Significance of spectral cues in automatic speech segmentation for Indian language speech...

J Chen, P Lai, A Chan, V Man, CH Chan - Sustainability, 2022 - mdpi.com

Oral presentation is a popular type of assessment in undergraduate degree programs.
However, presentation delivery and grading pose considerable challenges to students and …

被引用次数：8 相关文章所有 6 个版本

A review on speech synthesis based on machine learning

R Kumari, A Dev, A Kumar - International Conference on Artificial …, 2021 - Springer

Recently, Speech synthesis is one of the growing techniques in the research domain that
takes input as text and provides output as acoustical form. The speech synthesis system is …

被引用次数：3 相关文章

Towards Developing State-of-The-Art TTS Synthesisers for 13 Indian Languages with Signal Processing Aided Alignments

A Prakash, S Umesh, HA Murthy - 2023 IEEE Automatic Speech …, 2023 - ieeexplore.ieee.org

End-to-end (E2E) systems synthesise high-quality speech, but this typically requires a large
amount of data. As E2E synthesis progressed from Tacotron to FastSpeech2, it became …

[PDF] arxiv.org

Fast and small footprint Hybrid HMM-HiFiGAN based system for speech synthesis in Indian languages

S Srivastava, I Gupta, A Prakash, J Kuriakose… - arXiv preprint arXiv …, 2023 - arxiv.org

Hidden-Markov-model (HMM) based text-to-speech (HTS) offers flexibility in speaking styles
along with fast training and synthesis while being computationally less intense. HTS …

被引用次数：1 相关文章所有 3 个版本

[PDF] stae.com.cn

[PDF][PDF] 苗语语音音节自适应切分算法.

冯夫健，吴磊，谭棉，蔡姗，张学文，王林 - Science Technology & …, 2024 - stae.com.cn

摘要语音分割是苗语语音基础研究的难点和热点问题, 其本质是苗语语音音节与沉默段(静音,
噪音) 之间边界模糊问题, 目前相关研究成果较少. 针对苗语语音音节分割边界模糊问题 …

An efficient syllable-based speech segmentation model using fuzzy and threshold-based boundary detection

R Kumari, A Dev, A Kumar - International Journal of Computational …, 2022 - World Scientific

To develop a high-quality TTS system, an appropriate segmentation of continuous speech
into the syllabic units plays a vital role. The significant objective of this research work …

被引用次数：2 相关文章所有 2 个版本

Durational and Formantshift characteristics of Telugu alveolar and bilabial nasal phonemes

VKR Maddela, B Peri - 2021 IEEE Mysore Sub Section …, 2021 - ieeexplore.ieee.org

The phonetic-acoustic characteristics of phonemes of a stress-timed language like English
and those of syllable-timed Indian languages, are significantly different. Even within the …

被引用次数：3 相关文章

E-TTS: Expressive Text-to-Speech Synthesis for Hindi Using Data Augmentation

I Gupta, HA Murthy - International Conference on Speech and Computer, 2023 - Springer

Current state-of-the-art text-to-speech (TTS) systems trained on read-speech have reduced
issues with repetition or skipping of words and can produce natural-sounding speech …

[PDF][PDF] 基于对数包络的汉语音节切分算法

唐维康，邵玉斌，龙华 - Laser & Optoelectronics Progress, 2022 - researching.cn

摘要为提升目前连续汉语音节切分算法在噪声环境中的切分效果, 基于汉语语音的对数包络特征
提出一种音节切分算法, 用曲线插值法获取信号包络, 再经滤波和对数运算获取对数时域包络 …

Speech waveform reconstruction from speech parameters for an effective text to speech synthesis system using minimum phase harmonic sinusoidal model for …

N Kaur, P Singh - Multimedia Tools and Applications, 2022 - Springer

Speech processing plays a vital role in current speech communication applications. The
major objective of digital speech is transmission of messages among human and computer …

被引用次数：2 相关文章所有 4 个版本