Effectiveness of PLP-based phonetic segmentation for speech synthesis

MC Madhavi, HA Patil - Computer Speech & Language, 2017 - Elsevier

Query-by-Example approach of spoken content retrieval has gained much attention because
of its feasibility in the absence of speech recognition and its applicability in a multilingual …

被引用次数：20 相关文章所有 3 个版本

[PDF] researchgate.net

[PDF][PDF] Novel Pre-processing using Outlier Removal in Voice Conversion.

SV Rao, NJ Shah, HA Patil - SSW, 2016 - researchgate.net

Voice conversion (VC) technique modifies the speech utterance spoken by a source
speaker to make it sound like a target speaker is speaking. Gaussian Mixture Model (GMM) …

被引用次数：16 相关文章所有 6 个版本

A novel approach to remove outliers for parallel voice conversion

NJ Shah, HA Patil - Computer Speech & Language, 2019 - Elsevier

Alignment is a key step before learning a mapping function between a source and a target
speaker's spectral features in various state-of-the-art parallel data Voice Conversion (VC) …

被引用次数：10 相关文章所有 2 个版本

Language-resource independent speech segmentation using cues from a spectrogram image

SJ Leow, ES Chng, CH Lee - 2015 IEEE International …, 2015 - ieeexplore.ieee.org

In this paper, we use image processing techniques on the speech spectrogram to perform
speech phoneme segmentation. The proposed method relies solely on visual cues on the …

被引用次数：13 相关文章所有 2 个版本

Unsupervised phonetic segmentation of classical Arabic speech using forward and inverse characteristics of the vocal tract

M Javed, MMA Baig, SA Qazi - Arabian Journal for Science and …, 2020 - Springer

Automatic segmentation of speech is about identifying boundaries of phonemes in a given
utterance. This paper presents a strategy driven by cosine distance similarity scores for …

被引用次数：8 相关文章所有 2 个版本

Analysis of features and metrics for alignment in text-dependent voice conversion

NJ Shah, HA Patil - Pattern Recognition and Machine Intelligence: 7th …, 2017 - Springer

Voice Conversion (VC) is a technique that convert the perceived speaker identity from a
source speaker to a target speaker. Given a source and target speakers' parallel training …

被引用次数：11 相关文章

Unsupervised phoneme segmentation of continuous Arabic speech

HA Mait, N Aboutabit - International Journal of Speech Technology, 2024 - Springer

The development of a speech recognition system for the Arabic language presents a
significant challenge, mainly due to the limited availability of digital resources specific to this …

被引用次数：1 相关文章

[PDF] researchgate.net

[PDF][PDF] Effectiveness of Dynamic Features in INCA and Temporal Context-INCA.

NJ Shah, HA Patil - INTERSPEECH, 2018 - researchgate.net

Abstract Non-parallel Voice Conversion (VC) has gained significant attention since last one
decade. Obtaining corresponding speech frames from both the source and target speakers …

被引用次数：8 相关文章所有 7 个版本

[PDF] apsipa.org

Novel inter mixture weighted GMM posteriorgram for DNN and GAN-based voice conversion

NJ Shah, R Sreeraj, N Shah… - 2018 Asia-Pacific Signal …, 2018 - ieeexplore.ieee.org

Voice Conversion (VC) requires an alignment of the spectral features before learning the
mapping function, due to the speaking rate variations across the source and target speakers …

被引用次数：7 相关文章所有 3 个版本

[PDF] researchgate.net

On the convergence of INCA algorithm

NJ Shah, HA Patil - 2017 Asia-Pacific Signal and Information …, 2017 - ieeexplore.ieee.org

Development of text-independent Voice Conversion (VC) has gained more research interest
for last one decade. Alignment of the source and target speakers' spectral features before …

被引用次数：8 相关文章所有 4 个版本