Data augmentation and deep learning methods in sound classification: A systematic review

OO Abayomi-Alli, R Damaševičius, A Qazi… - Electronics, 2022 - mdpi.com
The aim of this systematic literature review (SLR) is to identify and critically evaluate current
research advancements with respect to small data and the use of data augmentation …

Automatic speech recognition (asr) systems for children: A systematic literature review

V Bhardwaj, MT Ben Othman, V Kukreja, Y Belkhier… - Applied Sciences, 2022 - mdpi.com
Automatic speech recognition (ASR) is one of the ways used to transform acoustic speech
signals into text. Over the last few decades, an enormous amount of research work has been …

A wav2vec2-based experimental study on self-supervised learning methods to improve child speech recognition

R Jain, A Barcovschi, MY Yiwere, D Bigioi… - IEEE …, 2023 - ieeexplore.ieee.org
Despite recent advancements in deep learning technologies, Child Speech Recognition
remains a challenging task. Current Automatic Speech Recognition (ASR) models require …

[HTML][HTML] A formant modification method for improved ASR of children's speech

HK Kathania, SR Kadiri, P Alku, M Kurimo - Speech Communication, 2022 - Elsevier
Differences in acoustic characteristics between children's and adults' speech degrade
performance of automatic speech recognition systems when systems trained using adults' …

A text-to-speech pipeline, evaluation methodology, and initial fine-tuning results for child speech synthesis

R Jain, MY Yiwere, D Bigioi, P Corcoran… - IEEE Access, 2022 - ieeexplore.ieee.org
Speech synthesis has come a long way as current text-to-speech (TTS) models can now
generate natural human-sounding speech. However, most of the TTS research focuses on …

In domain training data augmentation on noise robust Punjabi Children speech recognition

V Kadyan, P Bawa, T Hasija - Journal of Ambient Intelligence and …, 2022 - Springer
For building a successful automatic speech recognition (ASR) engine large training data is
required. It increases training complexity and become impossible for less resource language …

A review on lung disease recognition by acoustic signal analysis with deep learning networks

AH Sfayyih, N Sulaiman, AH Sabry - Journal of big Data, 2023 - Springer
Recently, assistive explanations for difficulties in the health check area have been made
viable thanks in considerable portion to technologies like deep learning and machine …

Lung disease recognition methods using audio-based analysis with machine learning

AH Sabry, OID Bashi, NHN Ali, YM Al Kubaisi - Heliyon, 2024 - cell.com
The use of computer-based automated approaches and improvements in lung sound
recording techniques have made lung sound-based diagnostics even better and devoid of …

[HTML][HTML] Crossing language identification: Multilingual ASR framework based on semantic dataset creation & Wav2Vec 2.0

OH Anidjar, R Yozevitch, N Bigon, N Abdalla… - Machine Learning with …, 2023 - Elsevier
This study proposes an innovative methodology to enhance the performance of multilingual
Automatic Speech Recognition (ASR) systems by capitalizing on the high semantic similarity …

Prosodic feature-based discriminatively trained low resource speech recognition system

T Hasija, V Kadyan, K Guleria, A Alharbi, H Alyami… - Sustainability, 2022 - mdpi.com
Speech recognition has been an active field of research in the last few decades since it
facilitates better human–computer interaction. Native language automatic speech …