Deep transfer learning for automatic speech recognition: Towards better generalization

H Kheddar, Y Himeur, S Al-Maadeed, A Amira… - Knowledge-Based …, 2023 - Elsevier
Automatic speech recognition (ASR) has recently become an important challenge when
using deep learning (DL). It requires large-scale training datasets and high computational …

Adaptation of Whisper models to child speech recognition

R Jain, A Barcovschi, M Yiwere, P Corcoran… - arXiv preprint arXiv …, 2023 - arxiv.org
Automatic Speech Recognition (ASR) systems often struggle with transcribing child speech
due to the lack of large child speech datasets required to accurately train child-friendly ASR …

A wav2vec2-based experimental study on self-supervised learning methods to improve child speech recognition

R Jain, A Barcovschi, MY Yiwere, D Bigioi… - IEEE …, 2023 - ieeexplore.ieee.org
Despite recent advancements in deep learning technologies, Child Speech Recognition
remains a challenging task. Current Automatic Speech Recognition (ASR) models require …

A comparative analysis between Conformer-Transducer, Whisper, and wav2vec2 for improving the child speech recognition

A Barcovschi, R Jain, P Corcoran - … International Conference on …, 2023 - ieeexplore.ieee.org
Automatic Speech Recognition (ASR) systems have progressed significantly in their
performance on adult speech data; however, transcribing child speech remains challenging …

Exploring Native and Non-Native English Child Speech Recognition With Whisper

R Jain, A Barcovschi, MY Yiwere, P Corcoran… - IEEE …, 2024 - ieeexplore.ieee.org
Modern end-to-end Automatic Speech Recognition (ASR) systems struggle to recognise
children's speech. This challenge is due to the high acoustic variability in children's voices …

Comparison of modern and traditional Slovak children's speech recognition

A Buday, J Juhár, A Čižmár… - 2023 World Symposium …, 2023 - ieeexplore.ieee.org
We compare two distinct speech recognition approaches, namely Hidden Markov models
mixed with deep neural networks and modern end-to-end neural speech recognition …

[PDF][PDF] Introduction To Partial Fine-tuning: A Comprehensive Evaluation Of End-to-end Children's Automatic Speech Recognition Adaptation

T Rolland, A Abad - Procs. of Interspeech, Kos Island, Greece, 2024 - isca-archive.org
Abstract Automatic Speech Recognition (ASR) encounters unique challenges when dealing
with children's speech, mainly due to the scarcity of available data. Training large ASR …

[PDF][PDF] Towards improved Automatic Speech Recognition for children

T Rolland, A Abad - Transfer, 2024 - isca-archive.org
Abstract Children's Automatic Speech Recognition (ASR) represents a considerable
challenge, with a considerable performance decline of state-of-the-art systems when …

Exploring Adapters with Conformers for Children's Automatic Speech Recognition

T Rolland, A Abad - ICASSP 2024-2024 IEEE International …, 2024 - ieeexplore.ieee.org
The high variability in acoustic, pronunciation, and linguistic characteristics of children's
speech makes of children's automatic speech recognition (ASR) a complex task. Training a …

FASA: a Flexible and Automatic Speech Aligner for Extracting High-quality Aligned Children Speech Data

D Liu, J Xiong - arXiv preprint arXiv:2406.17926, 2024 - arxiv.org
Automatic Speech Recognition (ASR) for adults' speeches has made significant progress by
employing deep neural network (DNN) models recently, but improvement in children's …