Glottal source contribution to higher order modes in the finite element synthesis of vowels

M Freixes, M Arnela, JC Socoró, F Alías, O Guasch - Applied Sciences, 2019 - mdpi.com
Articulatory speech synthesis has long been based on one-dimensional (1D) approaches.
They assume plane wave propagation within the vocal tract and disregard higher order …

Cascaded all-pass filters with randomized center frequencies and phase polarity for acoustic and speech measurement and data augmentation

H Kawahara, K Yatabe - ICASSP 2021-2021 IEEE International …, 2021 - ieeexplore.ieee.org
We introduce a new member of TSP (Time Stretched Pulse) for acoustic and speech
measurement infrastructure, based on a simple all-pass filter and systematic randomization …

Window functions with minimum-sidelobe derivatives for computing instantaneous frequency

T Kusano, K Yatabe, Y Oikawa - IEEE Access, 2022 - ieeexplore.ieee.org
In the field of nonstationary signal analysis and processing, the short-time Fourier transform
(STFT) is frequently used to convert signals into the time-frequency domain. The …

Simultaneous measurement of time-invariant linear and nonlinear, and random and extra responses using frequency domain variant of velvet noise

H Kawahara, KI Sakakibara… - 2020 Asia-Pacific …, 2020 - ieeexplore.ieee.org
We introduce a new acoustic measurement method that can measure the linear time-
invariant response, the nonlinear time-invariant response, and random and time-varying …

Simultaneous measurement of multiple acoustic attributes using structured periodic test signals including music and other sound materials

H Kawahara, K Yatabe, KI Sakakibara… - 2023 Asia Pacific …, 2023 - ieeexplore.ieee.org
We introduce a general framework for measuring acoustic properties such as liner time-
invariant (LTI) response, signal-dependent time-invariant (SDTI) component, and random …

Maximally energy-concentrated differential window for phase-aware signal processing using instantaneous frequency

T Kusano, K Yatabe, Y Oikawa - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
The short-time Fourier transform (STFT) is widely employed in non-stationary signal
analysis, whose property depends on window functions. Instantaneous frequency in STFT …

Mixture of orthogonal sequences made from extended time-stretched pulses enables measurement of involuntary voice fundamental frequency response to pitch …

H Kawahara, T Matsui, K Yatabe, KI Sakakibara… - arXiv preprint arXiv …, 2021 - arxiv.org
Auditory feedback plays an essential role in the regulation of the fundamental frequency of
voiced sounds. The fundamental frequency also responds to auditory stimulation other than …

Designing nearly tight window for improving time-frequency masking

T Kusano, Y Masuyama, K Yatabe… - arXiv preprint arXiv …, 2018 - arxiv.org
Many audio signal processing methods are formulated in the time-frequency (TF) domain
which is obtained by the short-time Fourier transform (STFT). The properties of the STFT are …

[PDF][PDF] Contribution of the glottal flow residual in affect-related voice transformation.

Z Wang, C Gobl - INTERSPEECH, 2022 - tara.tcd.ie
This paper explores the contribution of the glottal flow residual in affect-related voice
transformation. This signal, which is defined as the difference between the output of the …

EGAN: A Neural Excitation Generation Model Based on Generative Adversarial Networks with Harmonics and Noise Input

YT Lin, CY Chiang - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
This paper presents a speech synthesis method based on source-filter modeling. The
source model is powered by a neural excitation generator trained by GAN to produce an …