Speaker recognition based on deep learning: An overview

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier
Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

Challenges and opportunities of biometric user authentication in the age of iot: A survey

CW Lien, S Vhaduri - ACM Computing Surveys, 2023 - dl.acm.org
While the Internet of Things (IoT) devices, such as smartwatches, provide a range of services
from managing financial transactions to monitoring smart homes, these devices often lead to …

Transfer learning for speech and language processing

D Wang, TF Zheng - 2015 Asia-Pacific Signal and Information …, 2015 - ieeexplore.ieee.org
Transfer learning is a vital technique that generalizes models trained for one setting or task
to other settings or tasks. For example in speech recognition, an acoustic model trained for …

Spoken instruction understanding in air traffic control: Challenge, technique, and application

Y Lin - Aerospace, 2021 - mdpi.com
In air traffic control (ATC), speech communication with radio transmission is the primary way
to exchange information between the controller and aircrew. A wealth of contextual …

Deep representation learning in speech processing: Challenges, recent advances, and future trends

S Latif, R Rana, S Khalifa, R Jurdak, J Qadir… - arXiv preprint arXiv …, 2020 - arxiv.org
Research on speech processing has traditionally considered the task of designing hand-
engineered acoustic features (feature engineering) as a separate distinct problem from the …

End-to-end audiovisual speech recognition system with multitask learning

F Tao, C Busso - IEEE Transactions on Multimedia, 2020 - ieeexplore.ieee.org
An automatic speech recognition (ASR) system is a key component in current speech-based
systems. However, the surrounding acoustic noise can severely degrade the performance of …

[PDF][PDF] Angular Softmax for Short-Duration Text-independent Speaker Verification.

Z Huang, S Wang, K Yu - Interspeech, 2018 - isca-archive.org
Recently, researchers propose to build deep learning based endto-end speaker verification
(SV) systems and achieve competitive results compared with the standard i-vector approach …

Automatic speaker verification systems and spoof detection techniques: review and analysis

A Mittal, M Dua - International Journal of Speech Technology, 2022 - Springer
Automatic speaker verification (ASV) systems are enhanced enough, that industry is
attracted to use them practically in security systems. However, vulnerability of these systems …

Deep learning methods in speaker recognition: a review

D Sztahó, G Szaszák, A Beke - arXiv preprint arXiv:1911.06615, 2019 - arxiv.org
This paper summarizes the applied deep learning practices in the field of speaker
recognition, both verification and identification. Speaker recognition has been a widely used …

Past review, current progress, and challenges ahead on the cocktail party problem

Y Qian, C Weng, X Chang, S Wang, D Yu - Frontiers of Information …, 2018 - Springer
The cocktail party problem, ie, tracing and recognizing the speech of a specific speaker
when multiple speakers talk simultaneously, is one of the critical problems yet to be solved …