Tandem-STRAIGHT: A temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, F0, and aperiodicity …

H Kawahara, M Morise, T Takahashi… - … on acoustics, speech …, 2008 - ieeexplore.ieee.org
A simple new method for estimating temporally stable power spectra is introduced to provide
a unified basis for computing an interference-free spectrum, the fundamental frequency (F0) …

Technical foundations of TANDEM-STRAIGHT, a speech analysis, modification and synthesis framework

H Kawahara, M Morise - Sadhana, 2011 - Springer
This article presents comprehensive technical information about STRAIGHT and TANDEM-
STRAIGHT, a widely used speech modification tool and its successor. They share the same …

Development of exploratory research tools based on TANDEM-STRAIGHT

H Kawahara, T Takahashi… - … : APSIPA ASC 2009 …, 2009 - eprints.lib.hokudai.ac.jp
This article introduces a new set of tools based on TANDEM-STRAIGHT, a fundamental
reformulation of STRAIGHT, a speech analysis, modification and resynthesis system …

Temporally variable multi-aspect N-way morphing based on interference-free speech representations

H Kawahara, M Morise, H Banno… - 2013 Asia-Pacific Signal …, 2013 - ieeexplore.ieee.org
Voice morphing is a powerful tool for exploratory research and various applications. A
temporally variable multi-aspect morphing is extended to enable morphing of arbitrarily …

An interference-free representation of instantaneous frequency of periodic signals and its application to F0 extraction

H Kawahara, T Irino, M Morise - 2011 IEEE International …, 2011 - ieeexplore.ieee.org
An interference-free representation of the instantaneous frequency of constituent harmonic
components of periodic signals is introduced. The power weighted average instantaneous …

[PDF][PDF] Simplification and extension of non-periodic excitation source representations for high-quality speech manipulation systems.

H Kawahara, M Morise, T Takahashi, H Banno… - Interspeech, 2010 - Citeseer
A systematic framework for non-periodic excitation source representation is proposed for
high-quality speech manipulation systems such as TANDEM-STRAIGHT, which is basically …

Higher order waveform symmetry measure and its application to periodicity detectors for speech and singing with fine temporal resolution

H Kawahara, M Morise, R Nisimura… - 2013 IEEE International …, 2013 - ieeexplore.ieee.org
Another simple and high-speed F0 extractor with high temporal resolution based on our
previous proposal has been developed by adding a higher-order symmetry measure. This …

Studying the Effect of Frame-Level Concatenation of GFCC and TS-MFCC Features on Zero-Shot Children's ASR

Ankita, Shambhavi, S Shahnawazuddin - International Conference on …, 2023 - Springer
The work presented in this paper aims at enhancing the recognition performance of zero-
shot children's speech recognition task through frame-level concatenation of two …

Analysis and synthesis of strong vocal expressions: Extension and application of audio texture features to singing voice

H Kawahara, M Morise - 2012 IEEE International Conference …, 2012 - ieeexplore.ieee.org
Realistic reconstruction and manipulation of strong vocal expressions found in singing
voices is a challenging and exciting topic. A speech analysis, modification and resynthesis …

[PDF][PDF] Evaluation and optimization of F0-adaptive spectral envelope estimation based on spectral smoothing with peak emphasis

H Akagiri, M Morise, T Irino, H Kawahara - Trans. IEICE, 2011 - acoustics.asn.au
ABSTRACT A new spectral estimation method which improves processed sound quality of
STRAIGHT, a speech analysis, modification and re-synthesis framework widely used for …