Unsupervised domain adaptation via domain adversarial training for speaker recognition

A Mehrish, N Majumder, R Bharadwaj, R Mihalcea… - Information …, 2023 - Elsevier

The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …

被引用次数：138 相关文章所有 6 个版本

[PDF] arxiv.org

Speaker recognition based on deep learning: An overview

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier

Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

被引用次数：380 相关文章所有 9 个版本

[PDF] arxiv.org

A review of domain adaptation without target labels

WM Kouw, M Loog - IEEE transactions on pattern analysis and …, 2019 - ieeexplore.ieee.org

Domain adaptation has become a prominent problem setting in machine learning and
related fields. This review asks the question: How can a classifier learn from a source …

被引用次数：565 相关文章所有 12 个版本

[PDF] ieee.org

A survey of speaker recognition: Fundamental theories, recognition methods and opportunities

MM Kabir, MF Mridha, J Shin, I Jahan, AQ Ohi - IEEE Access, 2021 - ieeexplore.ieee.org

Humans can identify a speaker by listening to their voice, over the telephone, or on any
digital devices. Acquiring this congenital human competency, authentication technologies …

被引用次数：103 相关文章所有 4 个版本

[PDF] ieee.org

Domain adversarial for acoustic emotion recognition

M Abdelwahab, C Busso - IEEE/ACM Transactions on Audio …, 2018 - ieeexplore.ieee.org

The performance of speech emotion recognition is affected by the differences in data
distributions between train (source domain) and test (target domain) sets used to build and …

被引用次数：187 相关文章所有 8 个版本

[PDF] ieee.org

Domain adaptation: challenges, methods, datasets, and applications

P Singhal, R Walambe, S Ramanna, K Kotecha - IEEE access, 2023 - ieeexplore.ieee.org

Deep Neural Networks (DNNs) trained on one dataset (source domain) do not perform well
on another set of data (target domain), which is different but has similar properties as the …

被引用次数：43 相关文章所有 2 个版本

[PDF] arxiv.org

Augmentation adversarial training for self-supervised speaker recognition

J Huh, HS Heo, J Kang, S Watanabe… - arXiv preprint arXiv …, 2020 - arxiv.org

The goal of this work is to train robust speaker recognition models without speaker labels.
Recent works on unsupervised speaker representations are based on contrastive learning …

被引用次数：83 相关文章所有 3 个版本

[PDF] arxiv.org

Deep representation learning in speech processing: Challenges, recent advances, and future trends

S Latif, R Rana, S Khalifa, R Jurdak, J Qadir… - arXiv preprint arXiv …, 2020 - arxiv.org

Research on speech processing has traditionally considered the task of designing hand-
engineered acoustic features (feature engineering) as a separate distinct problem from the …

被引用次数：104 相关文章所有 3 个版本

[PDF] sciencedirect.com

Machine learning for stuttering identification: Review, challenges and future directions

SA Sheikh, M Sahidullah, F Hirsch, S Ouni - Neurocomputing, 2022 - Elsevier

Stuttering is a speech disorder during which the flow of speech is interrupted by involuntary
pauses and repetition of sounds. Stuttering identification is an interesting interdisciplinary …

被引用次数：53 相关文章所有 11 个版本

High-accuracy prediction and compensation of industrial robot stiffness deformation

C Ye, J Yang, H Ding - International Journal of Mechanical Sciences, 2022 - Elsevier

Industrial robots (IRs) are promising options for machining large complex structural parts
due to the higher flexibility, larger operating space, and lower cost compared with multi-axis …

被引用次数：36 相关文章