World: a vocoder-based high-quality speech synthesis system for real-time applications

M Morise, F Yokomori, K Ozawa - IEICE TRANSACTIONS on …, 2016 - search.ieice.org
A vocoder-based speech synthesis system, named WORLD, was developed in an effort to
improve the sound quality of real-time applications using speech. Speech analysis …

Quantifying sources of variability in infancy research using the infant-directed-speech preference

ManyBabies Consortium - Advances in Methods and …, 2020 - journals.sagepub.com
Psychological scientists have become increasingly concerned with issues related to
methodology and replicability, and infancy researchers in particular face specific challenges …

[HTML][HTML] Central gain restores auditory processing following near-complete cochlear denervation

AR Chambers, J Resnik, Y Yuan, JP Whitton, AS Edge… - Neuron, 2016 - cell.com
Sensory organ damage induces a host of cellular and physiological changes in the
periphery and the brain. Here, we show that some aspects of auditory processing recover …

[HTML][HTML] D4C, a band-aperiodicity estimator for high-quality speech synthesis

M Morise - Speech Communication, 2016 - Elsevier
An algorithm is proposed for estimating the band aperiodicity of speech signals, where
“aperiodicity” is defined as the power ratio between the speech signal and the aperiodic …

Cognitive compensation of speech perception with hearing impairment, cochlear implants, and aging: How and to what degree can it be achieved?

D Başkent, J Clarke, C Pals, MR Benard… - Trends in …, 2016 - journals.sagepub.com
External degradations in incoming speech reduce understanding, and hearing impairment
further compounds the problem. While cognitive mechanisms alleviate some of the …

Influences of fundamental frequency, formant frequencies, aperiodicity, and spectrum level on the perception of voice gender

VG Skuk, SR Schweinberger - 2014 - ASHA
Purpose To determine the relative importance of acoustic parameters (fundamental
frequency [F0], formant frequencies [FFs], aperiodicity, and spectrum level [SL]) on voice …

[HTML][HTML] CheapTrick, a spectral envelope estimator for high-quality speech synthesis

M Morise - Speech Communication, 2015 - Elsevier
A spectral envelope estimation algorithm is presented to achieve high-quality speech
synthesis. The concept of the algorithm is to obtain an accurate and temporally stable …

[PDF][PDF] Harvest: A High-Performance Fundamental Frequency Estimator from Speech Signals.

M Morise - INTERSPEECH, 2017 - isca-archive.org
A fundamental frequency (F0) estimator named Harvest is described. The unique points of
Harvest are that it can obtain a reliable F0 contour and reduce the error that the voiced …

In spoken word recognition, the future predicts the past

L Gwilliams, T Linzen, D Poeppel… - Journal of …, 2018 - Soc Neuroscience
Speech is an inherently noisy and ambiguous signal. To fluently derive meaning, a listener
must integrate contextual information to guide interpretations of the sensory input. Although …

[HTML][HTML] Inharmonic speech reveals the role of harmonicity in the cocktail party problem

S Popham, D Boebinger, DPW Ellis… - Nature …, 2018 - nature.com
The “cocktail party problem” requires us to discern individual sound sources from mixtures of
sources. The brain must use knowledge of natural sound regularities for this purpose. One …