Introduction to voice presentation attack detection and recent advances

M Sahidullah, H Delgado, M Todisco, A Nautsch… - Handbook of Biometric …, 2023 - Springer
Over the past few years, significant progress has been made in the field of presentation
attack detection (PAD) for automatic speaker recognition (ASV). This includes the …

One-class learning towards synthetic voice spoofing detection

Y Zhang, F Jiang, Z Duan - IEEE Signal Processing Letters, 2021 - ieeexplore.ieee.org
Human voices can be used to authenticate the identity of the speaker, but the automatic
speaker verification (ASV) systems are vulnerable to voice spoofing attacks, such as …

A comparative study on recent neural spoofing countermeasures for synthetic speech detection

X Wang, J Yamagishi - arXiv preprint arXiv:2103.11326, 2021 - arxiv.org
A great deal of recent research effort on speech spoofing countermeasures has been
invested into back-end neural networks and training criteria. We contribute to this effort with …

Towards end-to-end synthetic speech detection

G Hua, ABJ Teoh, H Zhang - IEEE Signal Processing Letters, 2021 - ieeexplore.ieee.org
The constant Q transform (CQT) has been shown to be one of the most effective speech
signal pre-transforms to facilitate synthetic speech detection, followed by either hand-crafted …

[PDF][PDF] End-to-end text-independent speaker verification with triplet loss on short utterances.

C Zhang, K Koishida - Interspeech, 2017 - isca-archive.org
Text-independent speaker verification against short utterances is still challenging despite of
recent advances in the field of speaker recognition with i-vector framework. In general, to get …

Text-independent speaker verification based on triplet convolutional neural network embeddings

C Zhang, K Koishida… - IEEE/ACM Transactions on …, 2018 - ieeexplore.ieee.org
The effectiveness of introducing deep neural networks into conventional speaker recognition
pipelines has been broadly shown to benefit system performance. A novel text-independent …

Evaluation of an audio-video multimodal deepfake dataset using unimodal and multimodal detectors

H Khalid, M Kim, S Tariq, SS Woo - Proceedings of the 1st workshop on …, 2021 - dl.acm.org
Significant advancements made in the generation of deepfakes have caused security and
privacy issues. Attackers can easily impersonate a person's identity in an image by replacing …

Multimodaltrace: Deepfake detection using audiovisual representation learning

MA Raza, KM Malik - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
By employing generative deep learning techniques, Deepfakes are created with the intent to
create mistrust in society, manipulate public opinion and political decisions, and for other …

Advances in anti-spoofing: from the perspective of ASVspoof challenges

MR Kamble, HB Sailor, HA Patil, H Li - APSIPA Transactions on …, 2020 - cambridge.org
In recent years, automatic speaker verification (ASV) is used extensively for voice biometrics.
This leads to an increased interest to secure these voice biometric systems for real-world …

Synthetic speech detection through short-term and long-term prediction traces

C Borrelli, P Bestagini, F Antonacci, A Sarti… - EURASIP Journal on …, 2021 - Springer
Several methods for synthetic audio speech generation have been developed in the
literature through the years. With the great technological advances brought by deep …