A review of deep learning techniques for speech processing

A Mehrish, N Majumder, R Bharadwaj, R Mihalcea… - Information …, 2023 - Elsevier
The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …

Speaker recognition based on deep learning: An overview

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier
Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

A review of domain adaptation without target labels

WM Kouw, M Loog - IEEE transactions on pattern analysis and …, 2019 - ieeexplore.ieee.org
Domain adaptation has become a prominent problem setting in machine learning and
related fields. This review asks the question: How can a classifier learn from a source …

A survey of speaker recognition: Fundamental theories, recognition methods and opportunities

MM Kabir, MF Mridha, J Shin, I Jahan, AQ Ohi - IEEE Access, 2021 - ieeexplore.ieee.org
Humans can identify a speaker by listening to their voice, over the telephone, or on any
digital devices. Acquiring this congenital human competency, authentication technologies …

Domain adversarial for acoustic emotion recognition

M Abdelwahab, C Busso - IEEE/ACM Transactions on Audio …, 2018 - ieeexplore.ieee.org
The performance of speech emotion recognition is affected by the differences in data
distributions between train (source domain) and test (target domain) sets used to build and …

Domain adaptation: challenges, methods, datasets, and applications

P Singhal, R Walambe, S Ramanna, K Kotecha - IEEE access, 2023 - ieeexplore.ieee.org
Deep Neural Networks (DNNs) trained on one dataset (source domain) do not perform well
on another set of data (target domain), which is different but has similar properties as the …

Augmentation adversarial training for self-supervised speaker recognition

J Huh, HS Heo, J Kang, S Watanabe… - arXiv preprint arXiv …, 2020 - arxiv.org
The goal of this work is to train robust speaker recognition models without speaker labels.
Recent works on unsupervised speaker representations are based on contrastive learning …

Deep representation learning in speech processing: Challenges, recent advances, and future trends

S Latif, R Rana, S Khalifa, R Jurdak, J Qadir… - arXiv preprint arXiv …, 2020 - arxiv.org
Research on speech processing has traditionally considered the task of designing hand-
engineered acoustic features (feature engineering) as a separate distinct problem from the …

Machine learning for stuttering identification: Review, challenges and future directions

SA Sheikh, M Sahidullah, F Hirsch, S Ouni - Neurocomputing, 2022 - Elsevier
Stuttering is a speech disorder during which the flow of speech is interrupted by involuntary
pauses and repetition of sounds. Stuttering identification is an interesting interdisciplinary …

High-accuracy prediction and compensation of industrial robot stiffness deformation

C Ye, J Yang, H Ding - International Journal of Mechanical Sciences, 2022 - Elsevier
Industrial robots (IRs) are promising options for machining large complex structural parts
due to the higher flexibility, larger operating space, and lower cost compared with multi-axis …