Speaker perception

SR Schweinberger, H Kawahara… - Wiley …, 2014 - Wiley Online Library
While humans use their voice mainly for communicating information about the world,
paralinguistic cues in the voice signal convey rich dynamic information about a speaker's …

[HTML][HTML] D4C, a band-aperiodicity estimator for high-quality speech synthesis

M Morise - Speech Communication, 2016 - Elsevier
An algorithm is proposed for estimating the band aperiodicity of speech signals, where
“aperiodicity” is defined as the power ratio between the speech signal and the aperiodic …

Influences of fundamental frequency, formant frequencies, aperiodicity, and spectrum level on the perception of voice gender

VG Skuk, SR Schweinberger - 2014 - ASHA
Purpose To determine the relative importance of acoustic parameters (fundamental
frequency [F0], formant frequencies [FFs], aperiodicity, and spectrum level [SL]) on voice …

[HTML][HTML] CheapTrick, a spectral envelope estimator for high-quality speech synthesis

M Morise - Speech Communication, 2015 - Elsevier
A spectral envelope estimation algorithm is presented to achieve high-quality speech
synthesis. The concept of the algorithm is to obtain an accurate and temporally stable …

Focal versus distributed temporal cortex activity for speech sound category assignment

S Bouton, V Chambon, R Tyrand… - Proceedings of the …, 2018 - National Acad Sciences
Percepts and words can be decoded from distributed neural activity measures. However, the
existence of widespread representations might conflict with the more classical notions of …

[HTML][HTML] The impact of alphabetic literacy on the perception of speech sounds

R Kolinsky, AL Navas, FV de Paula, NR de Brito… - Cognition, 2021 - Elsevier
The aim of the present study was to evaluate the impact of literacy on phoneme perception. It
built on previous research by using more controlled stimuli than in former studies and by …

Expression control in singing voice synthesis: Features, approaches, evaluation, and challenges

M Umbert, J Bonada, M Goto, T Nakano… - IEEE Signal …, 2015 - ieeexplore.ieee.org
In the context of singing voice synthesis, expression control manipulates a set of voice
features related to a particular emotion, style, or singer. Also known as performance …

Development of exploratory research tools based on TANDEM-STRAIGHT

H Kawahara, T Takahashi… - … : APSIPA ASC 2009 …, 2009 - eprints.lib.hokudai.ac.jp
This article introduces a new set of tools based on TANDEM-STRAIGHT, a fundamental
reformulation of STRAIGHT, a speech analysis, modification and resynthesis system …

Temporally variable multi-aspect N-way morphing based on interference-free speech representations

H Kawahara, M Morise, H Banno… - 2013 Asia-Pacific Signal …, 2013 - ieeexplore.ieee.org
Voice morphing is a powerful tool for exploratory research and various applications. A
temporally variable multi-aspect morphing is extended to enable morphing of arbitrarily …

Singing voice conversion method based on many-to-many eigenvoice conversion and training data generation using a singing-to-singing synthesis system

H Doi, T Toda, T Nakano, M Goto… - Proceedings of The …, 2012 - ieeexplore.ieee.org
The voice quality (identity) of singing voices is usually fixed in each singer. To overcome this
limitation and enable singers to freely change their voice quality using signal-processing …