Partial matching and search space reduction for QbE-STD
MC Madhavi, HA Patil - Computer Speech & Language, 2017 - Elsevier
Query-by-Example approach of spoken content retrieval has gained much attention because
of its feasibility in the absence of speech recognition and its applicability in a multilingual …
of its feasibility in the absence of speech recognition and its applicability in a multilingual …
[PDF][PDF] Novel Pre-processing using Outlier Removal in Voice Conversion.
Voice conversion (VC) technique modifies the speech utterance spoken by a source
speaker to make it sound like a target speaker is speaking. Gaussian Mixture Model (GMM) …
speaker to make it sound like a target speaker is speaking. Gaussian Mixture Model (GMM) …
A novel approach to remove outliers for parallel voice conversion
Alignment is a key step before learning a mapping function between a source and a target
speaker's spectral features in various state-of-the-art parallel data Voice Conversion (VC) …
speaker's spectral features in various state-of-the-art parallel data Voice Conversion (VC) …
Language-resource independent speech segmentation using cues from a spectrogram image
In this paper, we use image processing techniques on the speech spectrogram to perform
speech phoneme segmentation. The proposed method relies solely on visual cues on the …
speech phoneme segmentation. The proposed method relies solely on visual cues on the …
Unsupervised phonetic segmentation of classical Arabic speech using forward and inverse characteristics of the vocal tract
Automatic segmentation of speech is about identifying boundaries of phonemes in a given
utterance. This paper presents a strategy driven by cosine distance similarity scores for …
utterance. This paper presents a strategy driven by cosine distance similarity scores for …
Analysis of features and metrics for alignment in text-dependent voice conversion
Voice Conversion (VC) is a technique that convert the perceived speaker identity from a
source speaker to a target speaker. Given a source and target speakers' parallel training …
source speaker to a target speaker. Given a source and target speakers' parallel training …
Unsupervised phoneme segmentation of continuous Arabic speech
HA Mait, N Aboutabit - International Journal of Speech Technology, 2024 - Springer
The development of a speech recognition system for the Arabic language presents a
significant challenge, mainly due to the limited availability of digital resources specific to this …
significant challenge, mainly due to the limited availability of digital resources specific to this …
[PDF][PDF] Effectiveness of Dynamic Features in INCA and Temporal Context-INCA.
Abstract Non-parallel Voice Conversion (VC) has gained significant attention since last one
decade. Obtaining corresponding speech frames from both the source and target speakers …
decade. Obtaining corresponding speech frames from both the source and target speakers …
Novel inter mixture weighted GMM posteriorgram for DNN and GAN-based voice conversion
Voice Conversion (VC) requires an alignment of the spectral features before learning the
mapping function, due to the speaking rate variations across the source and target speakers …
mapping function, due to the speaking rate variations across the source and target speakers …
On the convergence of INCA algorithm
Development of text-independent Voice Conversion (VC) has gained more research interest
for last one decade. Alignment of the source and target speakers' spectral features before …
for last one decade. Alignment of the source and target speakers' spectral features before …