An investigation of deep-learning frameworks for speaker verification antispoofing

M Sahidullah, H Delgado, M Todisco, A Nautsch… - Handbook of Biometric …, 2023 - Springer

Over the past few years, significant progress has been made in the field of presentation
attack detection (PAD) for automatic speaker recognition (ASV). This includes the …

被引用次数：91 相关文章所有 18 个版本

[PDF] ieee.org

One-class learning towards synthetic voice spoofing detection

Y Zhang, F Jiang, Z Duan - IEEE Signal Processing Letters, 2021 - ieeexplore.ieee.org

Human voices can be used to authenticate the identity of the speaker, but the automatic
speaker verification (ASV) systems are vulnerable to voice spoofing attacks, such as …

被引用次数：230 相关文章所有 8 个版本

[PDF] arxiv.org

A comparative study on recent neural spoofing countermeasures for synthetic speech detection

X Wang, J Yamagishi - arXiv preprint arXiv:2103.11326, 2021 - arxiv.org

A great deal of recent research effort on speech spoofing countermeasures has been
invested into back-end neural networks and training criteria. We contribute to this effort with …

被引用次数：178 相关文章所有 7 个版本

[PDF] arxiv.org

Towards end-to-end synthetic speech detection

G Hua, ABJ Teoh, H Zhang - IEEE Signal Processing Letters, 2021 - ieeexplore.ieee.org

The constant Q transform (CQT) has been shown to be one of the most effective speech
signal pre-transforms to facilitate synthetic speech detection, followed by either hand-crafted …

被引用次数：134 相关文章所有 4 个版本

[PDF] isca-archive.org

[PDF][PDF] End-to-end text-independent speaker verification with triplet loss on short utterances.

C Zhang, K Koishida - Interspeech, 2017 - isca-archive.org

Text-independent speaker verification against short utterances is still challenging despite of
recent advances in the field of speaker recognition with i-vector framework. In general, to get …

被引用次数：276 相关文章所有 5 个版本

[PDF] researchgate.net

Text-independent speaker verification based on triplet convolutional neural network embeddings

C Zhang, K Koishida… - IEEE/ACM Transactions on …, 2018 - ieeexplore.ieee.org

The effectiveness of introducing deep neural networks into conventional speaker recognition
pipelines has been broadly shown to benefit system performance. A novel text-independent …

被引用次数：184 相关文章所有 5 个版本

[PDF] arxiv.org

Evaluation of an audio-video multimodal deepfake dataset using unimodal and multimodal detectors

H Khalid, M Kim, S Tariq, SS Woo - Proceedings of the 1st workshop on …, 2021 - dl.acm.org

Significant advancements made in the generation of deepfakes have caused security and
privacy issues. Attackers can easily impersonate a person's identity in an image by replacing …

被引用次数：71 相关文章所有 4 个版本

[PDF] thecvf.com

Multimodaltrace: Deepfake detection using audiovisual representation learning

MA Raza, KM Malik - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com

By employing generative deep learning techniques, Deepfakes are created with the intent to
create mistrust in society, manipulate public opinion and political decisions, and for other …

被引用次数：25 相关文章所有 4 个版本

[PDF] cambridge.org

Advances in anti-spoofing: from the perspective of ASVspoof challenges

MR Kamble, HB Sailor, HA Patil, H Li - APSIPA Transactions on …, 2020 - cambridge.org

In recent years, automatic speaker verification (ASV) is used extensively for voice biometrics.
This leads to an increased interest to secure these voice biometric systems for real-world …

被引用次数：113 相关文章所有 4 个版本

[PDF] springer.com

Synthetic speech detection through short-term and long-term prediction traces

C Borrelli, P Bestagini, F Antonacci, A Sarti… - EURASIP Journal on …, 2021 - Springer

Several methods for synthetic audio speech generation have been developed in the
literature through the years. With the great technological advances brought by deep …

被引用次数：67 相关文章所有 9 个版本