A survey of speaker recognition: Fundamental theories, recognition methods and opportunities

MM Kabir, MF Mridha, J Shin, I Jahan, AQ Ohi - IEEE Access, 2021 - ieeexplore.ieee.org
Humans can identify a speaker by listening to their voice, over the telephone, or on any
digital devices. Acquiring this congenital human competency, authentication technologies …

Deep learning in diverse intelligent sensor based systems

Y Zhu, M Wang, X Yin, J Zhang, E Meijering, J Hu - Sensors, 2022 - mdpi.com
Deep learning has become a predominant method for solving data analysis problems in
virtually all fields of science and engineering. The increasing complexity and the large …

[PDF][PDF] End-to-end text-independent speaker verification with triplet loss on short utterances.

C Zhang, K Koishida - Interspeech, 2017 - isca-archive.org
Text-independent speaker verification against short utterances is still challenging despite of
recent advances in the field of speaker recognition with i-vector framework. In general, to get …

Large margin softmax loss for speaker verification

Y Liu, L He, J Liu - arXiv preprint arXiv:1904.03479, 2019 - arxiv.org
In neural network based speaker verification, speaker embedding is expected to be
discriminative between speakers while the intra-speaker distance should remain small. A …

Text-independent speaker verification based on triplet convolutional neural network embeddings

C Zhang, K Koishida… - IEEE/ACM Transactions on …, 2018 - ieeexplore.ieee.org
The effectiveness of introducing deep neural networks into conventional speaker recognition
pipelines has been broadly shown to benefit system performance. A novel text-independent …

Self-supervised text-independent speaker verification using prototypical momentum contrastive learning

W Xia, C Zhang, C Weng, M Yu… - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
In this study, we investigate self-supervised representation learning for speaker verification
(SV). First, we examine a simple contrastive learning approach (SimCLR) with a momentum …

Arabic sign language recognition system using 2D hands and body skeleton data

MA Bencherif, M Algabri, MA Mekhtiche, M Faisal… - IEEE …, 2021 - ieeexplore.ieee.org
This paper presents a novel Arabic Sign Language (ArSL) recognition system, using
selected 2D hands and body key points from successive video frames. The system …

C3-DINO: Joint contrastive and non-contrastive self-supervised learning for speaker verification

C Zhang, D Yu - IEEE Journal of Selected Topics in Signal …, 2022 - ieeexplore.ieee.org
Self-supervised learning (SSL) has drawn an increased attention in the field of speech
processing. Recent studies have demonstrated that contrastive learning is able to learn …

Deep speaker recognition: Process, progress, and challenges

AQ Ohi, MF Mridha, MA Hamid, MM Monowar - IEEE Access, 2021 - ieeexplore.ieee.org
Speaker recognition is related to human biometrics dealing with the identification of
speakers from their speech. Speaker recognition is an active research area and being …

A survey on artificial intelligence-based acoustic source identification

R Zaheer, I Ahmad, D Habibi, KY Islam… - IEEE Access, 2023 - ieeexplore.ieee.org
The concept of Acoustic Source Identification (ASI), which refers to the process of identifying
noise sources has attracted increasing attention in recent years. The ASI technology can be …