Speaker recognition based on deep learning: An overview

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier
Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

Challenges and opportunities of biometric user authentication in the age of iot: A survey

CW Lien, S Vhaduri - ACM Computing Surveys, 2023 - dl.acm.org
While the Internet of Things (IoT) devices, such as smartwatches, provide a range of services
from managing financial transactions to monitoring smart homes, these devices often lead to …

Large-scale self-supervised speech representation learning for automatic speaker verification

Z Chen, S Chen, Y Wu, Y Qian, C Wang… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
The speech representations learned from large-scale unlabeled data have shown better
generalizability than those from supervised learning and thus attract a lot of interest to be …

A review on human-computer interaction and intelligent robots

F Ren, Y Bao - International Journal of Information Technology & …, 2020 - World Scientific
In the field of artificial intelligence, human–computer interaction (HCI) technology and its
related intelligent robot technologies are essential and interesting contents of research …

Spoken instruction understanding in air traffic control: Challenge, technique, and application

Y Lin - Aerospace, 2021 - mdpi.com
In air traffic control (ATC), speech communication with radio transmission is the primary way
to exchange information between the controller and aircrew. A wealth of contextual …

[PDF][PDF] Angular Softmax for Short-Duration Text-independent Speaker Verification.

Z Huang, S Wang, K Yu - Interspeech, 2018 - isca-archive.org
Recently, researchers propose to build deep learning based endto-end speaker verification
(SV) systems and achieve competitive results compared with the standard i-vector approach …

Audio-visual deep neural network for robust person verification

Y Qian, Z Chen, S Wang - IEEE/ACM Transactions on Audio …, 2021 - ieeexplore.ieee.org
Voice and face are two most popular biometrics for person verification, usually used in
speaker verification and face verification tasks. It has already been observed that simply …

Layer-wise fast adaptation for end-to-end multi-accent speech recognition

Y Qian, X Gong, H Huang - IEEE/ACM Transactions on Audio …, 2022 - ieeexplore.ieee.org
The variety and complexity of accents pose a huge challenge to robust Automatic Speech
Recognition (ASR). Some previous work has attempted to address such problems, however …

Deep learning methods in speaker recognition: a review

D Sztahó, G Szaszák, A Beke - arXiv preprint arXiv:1911.06615, 2019 - arxiv.org
This paper summarizes the applied deep learning practices in the field of speaker
recognition, both verification and identification. Speaker recognition has been a widely used …

Analysis of DNN approaches to speaker identification

P Matějka, O Glembek, O Novotný… - … on acoustics, speech …, 2016 - ieeexplore.ieee.org
This work studies the usage of the Deep Neural Network (DNN) Bottleneck (BN) features
together with the traditional MFCC features in the task of i-vector-based speaker recognition …