- 学术资源搜索

A comprehensive literature review on children's databases for machine learning applications

S Al-Azani, SM Sait, KA Al-Utaibi - IEEE Access, 2022 - ieeexplore.ieee.org

The COVID-19 pandemic can be attributed as a main factor to accelerate the current digital
transformation and to encourage innovation and technological adoption. Consequently, the …

被引用次数：9 相关文章所有 5 个版本

[PDF] uu.nl

Emotion, age, and gender classification in children's speech by humans and machines

H Kaya, AA Salah, A Karpov, O Frolova… - Computer Speech & …, 2017 - Elsevier

In this article, we present the first child emotional speech corpus in Russian, called
“EmoChildRu”, collected from 3 to 7 years old children. The base corpus includes over 20 K …

被引用次数：80 相关文章所有 9 个版本

[PDF] academia.edu

[PDF][PDF] Automatic identification of gender from speech

SI Levitan, T Mishra, S Bangalore - Proceeding of speech prosody, 2016 - academia.edu

Identifying the gender of a speaker from speech has a variety of applications ranging from
speech analytics to personalizing human-machine interactions. While gender identification …

被引用次数：79 相关文章所有 9 个版本

Age-vox-celeb: Multi-modal corpus for facial and speech estimation

N Tawara, A Ogawa, Y Kitagishi… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org

Estimating a speaker's age from their speech is more challenging than age estimation from
their face because of insufficiently available public corpora. To tackle this problem, we …

被引用次数：27 相关文章

[PDF] iisc.ac.in

Speaker age estimation on conversational telephone speech using senone posterior based i-vectors

SO Sadjadi, S Ganapathy… - 2016 IEEE International …, 2016 - ieeexplore.ieee.org

Automatic age estimation from speech has a variety of applications including natural human-
computer interaction, targeted advertising, customer-agent pairing in call centers, and …

被引用次数：58 相关文章所有 6 个版本

[PDF] odyssey2016.org

[PDF][PDF] Identification of British English regional accents using fusion of i-vector and multi-accent phonotactic systems.

M Najafian, S Safavi, P Weber, MJ Russell - Odyssey, 2016 - odyssey2016.org

The para-linguistic information in a speech signal includes clues to the geographical and
social background of the speaker. This paper is concerned with recognition of the 14 …

被引用次数：56 相关文章所有 6 个版本

[PDF] ieee.org

Augmentation techniques for adult-speech to generate child-like speech data samples at scale

MY Yiwere, A Barcovschi, R Jain, H Cucu… - IEEE …, 2023 - ieeexplore.ieee.org

Technologies such as Text-To-Speech (TTS) synthesis and Automatic Speech Recognition
(ASR) have become important in providing speech-based Artificial Intelligence (AI) solutions …

被引用次数：4 相关文章

Automated prediction of children's age from voice acoustics

M Novotny, R Cmejla, T Tykalova - Biomedical Signal Processing and …, 2023 - Elsevier

The emergence of a variety of applications aimed at video gaming, parental control,
education, specific language impairment, child development assessment, and speech …

被引用次数：4 相关文章

[PDF] uea.ac.uk

[PDF][PDF] Unsupervised model selection for recognition of regional accented speech

M Najafian, A DeMarco, S Cox… - … annual conference of the …, 2014 - cmp.uea.ac.uk

This paper is concerned with automatic speech recognition (ASR) for accented speech.
Given a small amount of speech from a new speaker, is it better to apply speaker adaptation …

被引用次数：40 相关文章所有 14 个版本

Automatic speaker and age identification of children from raw speech using sincNet over ERB scale

K Radha, M Bansal, RB Pachori - Speech Communication, 2024 - Elsevier

This paper presents the newly developed non-native children's English speech (NNCES)
corpus to reveal the findings of automatic speaker and age recognition from raw speech …

被引用次数：7 相关文章