A comprehensive literature review on children's databases for machine learning applications
The COVID-19 pandemic can be attributed as a main factor to accelerate the current digital
transformation and to encourage innovation and technological adoption. Consequently, the …
transformation and to encourage innovation and technological adoption. Consequently, the …
Emotion, age, and gender classification in children's speech by humans and machines
In this article, we present the first child emotional speech corpus in Russian, called
“EmoChildRu”, collected from 3 to 7 years old children. The base corpus includes over 20 K …
“EmoChildRu”, collected from 3 to 7 years old children. The base corpus includes over 20 K …
[PDF][PDF] Automatic identification of gender from speech
SI Levitan, T Mishra, S Bangalore - Proceeding of speech prosody, 2016 - academia.edu
Identifying the gender of a speaker from speech has a variety of applications ranging from
speech analytics to personalizing human-machine interactions. While gender identification …
speech analytics to personalizing human-machine interactions. While gender identification …
Age-vox-celeb: Multi-modal corpus for facial and speech estimation
Estimating a speaker's age from their speech is more challenging than age estimation from
their face because of insufficiently available public corpora. To tackle this problem, we …
their face because of insufficiently available public corpora. To tackle this problem, we …
Speaker age estimation on conversational telephone speech using senone posterior based i-vectors
SO Sadjadi, S Ganapathy… - 2016 IEEE International …, 2016 - ieeexplore.ieee.org
Automatic age estimation from speech has a variety of applications including natural human-
computer interaction, targeted advertising, customer-agent pairing in call centers, and …
computer interaction, targeted advertising, customer-agent pairing in call centers, and …
[PDF][PDF] Identification of British English regional accents using fusion of i-vector and multi-accent phonotactic systems.
The para-linguistic information in a speech signal includes clues to the geographical and
social background of the speaker. This paper is concerned with recognition of the 14 …
social background of the speaker. This paper is concerned with recognition of the 14 …
Augmentation techniques for adult-speech to generate child-like speech data samples at scale
Technologies such as Text-To-Speech (TTS) synthesis and Automatic Speech Recognition
(ASR) have become important in providing speech-based Artificial Intelligence (AI) solutions …
(ASR) have become important in providing speech-based Artificial Intelligence (AI) solutions …
Automated prediction of children's age from voice acoustics
The emergence of a variety of applications aimed at video gaming, parental control,
education, specific language impairment, child development assessment, and speech …
education, specific language impairment, child development assessment, and speech …
[PDF][PDF] Unsupervised model selection for recognition of regional accented speech
This paper is concerned with automatic speech recognition (ASR) for accented speech.
Given a small amount of speech from a new speaker, is it better to apply speaker adaptation …
Given a small amount of speech from a new speaker, is it better to apply speaker adaptation …
Automatic speaker and age identification of children from raw speech using sincNet over ERB scale
This paper presents the newly developed non-native children's English speech (NNCES)
corpus to reveal the findings of automatic speaker and age recognition from raw speech …
corpus to reveal the findings of automatic speaker and age recognition from raw speech …