Automatic speaker profiling from short duration speech data
Many paralinguistic applications of speech demand the extraction of information about the
speaker characteristics from as little speech data as possible. In this work, we explore the
estimation of multiple physical parameters of the speaker from the short duration of speech
in a multilingual setting. We explore different feature streams for age and body build
estimation derived from the speech spectrum at different resolutions, namely–short-term log-
mel spectrogram, formant features and harmonic features of the speech. The statistics of …
speaker characteristics from as little speech data as possible. In this work, we explore the
estimation of multiple physical parameters of the speaker from the short duration of speech
in a multilingual setting. We explore different feature streams for age and body build
estimation derived from the speech spectrum at different resolutions, namely–short-term log-
mel spectrogram, formant features and harmonic features of the speech. The statistics of …
以上显示的是最相近的搜索结果。 查看全部搜索结果