Partial matching and search space reduction for QbE-STD

MC Madhavi, HA Patil - Computer Speech & Language, 2017 - Elsevier
Query-by-Example approach of spoken content retrieval has gained much attention because
of its feasibility in the absence of speech recognition and its applicability in a multilingual …

[PDF][PDF] Novel Pre-processing using Outlier Removal in Voice Conversion.

SV Rao, NJ Shah, HA Patil - SSW, 2016 - researchgate.net
Voice conversion (VC) technique modifies the speech utterance spoken by a source
speaker to make it sound like a target speaker is speaking. Gaussian Mixture Model (GMM) …

A novel approach to remove outliers for parallel voice conversion

NJ Shah, HA Patil - Computer Speech & Language, 2019 - Elsevier
Alignment is a key step before learning a mapping function between a source and a target
speaker's spectral features in various state-of-the-art parallel data Voice Conversion (VC) …

Language-resource independent speech segmentation using cues from a spectrogram image

SJ Leow, ES Chng, CH Lee - 2015 IEEE International …, 2015 - ieeexplore.ieee.org
In this paper, we use image processing techniques on the speech spectrogram to perform
speech phoneme segmentation. The proposed method relies solely on visual cues on the …

Unsupervised phonetic segmentation of classical Arabic speech using forward and inverse characteristics of the vocal tract

M Javed, MMA Baig, SA Qazi - Arabian Journal for Science and …, 2020 - Springer
Automatic segmentation of speech is about identifying boundaries of phonemes in a given
utterance. This paper presents a strategy driven by cosine distance similarity scores for …

Analysis of features and metrics for alignment in text-dependent voice conversion

NJ Shah, HA Patil - Pattern Recognition and Machine Intelligence: 7th …, 2017 - Springer
Voice Conversion (VC) is a technique that convert the perceived speaker identity from a
source speaker to a target speaker. Given a source and target speakers' parallel training …

Unsupervised phoneme segmentation of continuous Arabic speech

HA Mait, N Aboutabit - International Journal of Speech Technology, 2024 - Springer
The development of a speech recognition system for the Arabic language presents a
significant challenge, mainly due to the limited availability of digital resources specific to this …

[PDF][PDF] Effectiveness of Dynamic Features in INCA and Temporal Context-INCA.

NJ Shah, HA Patil - INTERSPEECH, 2018 - researchgate.net
Abstract Non-parallel Voice Conversion (VC) has gained significant attention since last one
decade. Obtaining corresponding speech frames from both the source and target speakers …

Novel inter mixture weighted GMM posteriorgram for DNN and GAN-based voice conversion

NJ Shah, R Sreeraj, N Shah… - 2018 Asia-Pacific Signal …, 2018 - ieeexplore.ieee.org
Voice Conversion (VC) requires an alignment of the spectral features before learning the
mapping function, due to the speaking rate variations across the source and target speakers …

On the convergence of INCA algorithm

NJ Shah, HA Patil - 2017 Asia-Pacific Signal and Information …, 2017 - ieeexplore.ieee.org
Development of text-independent Voice Conversion (VC) has gained more research interest
for last one decade. Alignment of the source and target speakers' spectral features before …