A comprehensive literature review on children's databases for machine learning applications

S Al-Azani, SM Sait, KA Al-Utaibi - IEEE Access, 2022 - ieeexplore.ieee.org
The COVID-19 pandemic can be attributed as a main factor to accelerate the current digital
transformation and to encourage innovation and technological adoption. Consequently, the …

Emotion, age, and gender classification in children's speech by humans and machines

H Kaya, AA Salah, A Karpov, O Frolova… - Computer Speech & …, 2017 - Elsevier
In this article, we present the first child emotional speech corpus in Russian, called
“EmoChildRu”, collected from 3 to 7 years old children. The base corpus includes over 20 K …

[PDF][PDF] Automatic identification of gender from speech

SI Levitan, T Mishra, S Bangalore - Proceeding of speech prosody, 2016 - academia.edu
Identifying the gender of a speaker from speech has a variety of applications ranging from
speech analytics to personalizing human-machine interactions. While gender identification …

Age-vox-celeb: Multi-modal corpus for facial and speech estimation

N Tawara, A Ogawa, Y Kitagishi… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Estimating a speaker's age from their speech is more challenging than age estimation from
their face because of insufficiently available public corpora. To tackle this problem, we …

Speaker age estimation on conversational telephone speech using senone posterior based i-vectors

SO Sadjadi, S Ganapathy… - 2016 IEEE International …, 2016 - ieeexplore.ieee.org
Automatic age estimation from speech has a variety of applications including natural human-
computer interaction, targeted advertising, customer-agent pairing in call centers, and …

[PDF][PDF] Identification of British English regional accents using fusion of i-vector and multi-accent phonotactic systems.

M Najafian, S Safavi, P Weber, MJ Russell - Odyssey, 2016 - odyssey2016.org
The para-linguistic information in a speech signal includes clues to the geographical and
social background of the speaker. This paper is concerned with recognition of the 14 …

Augmentation techniques for adult-speech to generate child-like speech data samples at scale

MY Yiwere, A Barcovschi, R Jain, H Cucu… - IEEE …, 2023 - ieeexplore.ieee.org
Technologies such as Text-To-Speech (TTS) synthesis and Automatic Speech Recognition
(ASR) have become important in providing speech-based Artificial Intelligence (AI) solutions …

Automated prediction of children's age from voice acoustics

M Novotny, R Cmejla, T Tykalova - Biomedical Signal Processing and …, 2023 - Elsevier
The emergence of a variety of applications aimed at video gaming, parental control,
education, specific language impairment, child development assessment, and speech …

[PDF][PDF] Unsupervised model selection for recognition of regional accented speech

M Najafian, A DeMarco, S Cox… - … annual conference of the …, 2014 - cmp.uea.ac.uk
This paper is concerned with automatic speech recognition (ASR) for accented speech.
Given a small amount of speech from a new speaker, is it better to apply speaker adaptation …

Automatic speaker and age identification of children from raw speech using sincNet over ERB scale

K Radha, M Bansal, RB Pachori - Speech Communication, 2024 - Elsevier
This paper presents the newly developed non-native children's English speech (NNCES)
corpus to reveal the findings of automatic speaker and age recognition from raw speech …