Multi-task learning for text-dependent speaker verification

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier

Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

被引用次数：419 相关文章所有 9 个版本

[PDF] acm.org

Challenges and opportunities of biometric user authentication in the age of iot: A survey

CW Lien, S Vhaduri - ACM Computing Surveys, 2023 - dl.acm.org

While the Internet of Things (IoT) devices, such as smartwatches, provide a range of services
from managing financial transactions to monitoring smart homes, these devices often lead to …

被引用次数：44 相关文章

[PDF] arxiv.org

Transfer learning for speech and language processing

D Wang, TF Zheng - 2015 Asia-Pacific Signal and Information …, 2015 - ieeexplore.ieee.org

Transfer learning is a vital technique that generalizes models trained for one setting or task
to other settings or tasks. For example in speech recognition, an acoustic model trained for …

被引用次数：262 相关文章所有 12 个版本

[PDF] mdpi.com

Spoken instruction understanding in air traffic control: Challenge, technique, and application

Y Lin - Aerospace, 2021 - mdpi.com

In air traffic control (ATC), speech communication with radio transmission is the primary way
to exchange information between the controller and aircrew. A wealth of contextual …

被引用次数：72 相关文章所有 10 个版本

[PDF] arxiv.org

Deep representation learning in speech processing: Challenges, recent advances, and future trends

S Latif, R Rana, S Khalifa, R Jurdak, J Qadir… - arXiv preprint arXiv …, 2020 - arxiv.org

Research on speech processing has traditionally considered the task of designing hand-
engineered acoustic features (feature engineering) as a separate distinct problem from the …

被引用次数：112 相关文章所有 3 个版本

[PDF] ieee.org

End-to-end audiovisual speech recognition system with multitask learning

F Tao, C Busso - IEEE Transactions on Multimedia, 2020 - ieeexplore.ieee.org

An automatic speech recognition (ASR) system is a key component in current speech-based
systems. However, the surrounding acoustic noise can severely degrade the performance of …

被引用次数：92 相关文章所有 4 个版本

[PDF] isca-archive.org

[PDF][PDF] Angular Softmax for Short-Duration Text-independent Speaker Verification.

Z Huang, S Wang, K Yu - Interspeech, 2018 - isca-archive.org

Recently, researchers propose to build deep learning based endto-end speaker verification
(SV) systems and achieve competitive results compared with the standard i-vector approach …

被引用次数：119 相关文章所有 5 个版本

[PDF] researchgate.net

Automatic speaker verification systems and spoof detection techniques: review and analysis

A Mittal, M Dua - International Journal of Speech Technology, 2022 - Springer

Automatic speaker verification (ASV) systems are enhanced enough, that industry is
attracted to use them practically in security systems. However, vulnerability of these systems …

被引用次数：55 相关文章所有 4 个版本

[PDF] arxiv.org

Deep learning methods in speaker recognition: a review

D Sztahó, G Szaszák, A Beke - arXiv preprint arXiv:1911.06615, 2019 - arxiv.org

This paper summarizes the applied deep learning practices in the field of speaker
recognition, both verification and identification. Speaker recognition has been a widely used …

被引用次数：80 相关文章所有 8 个版本

[HTML] zju.edu.cn

Past review, current progress, and challenges ahead on the cocktail party problem

Y Qian, C Weng, X Chang, S Wang, D Yu - Frontiers of Information …, 2018 - Springer

The cocktail party problem, ie, tracing and recognizing the speech of a specific speaker
when multiple speakers talk simultaneously, is one of the critical problems yet to be solved …

被引用次数：102 相关文章所有 6 个版本