Audio Anti-Spoofing Detection: A Survey

M Li, Y Ahmadiadli, XP Zhang - arXiv preprint arXiv:2404.13914, 2024 - arxiv.org
The availability of smart devices leads to an exponential increase in multimedia content.
However, the rapid advancements in deep learning have given rise to sophisticated …

Slim: Style-linguistics mismatch model for generalized audio deepfake detection

Y Zhu, S Koppisetti, T Tran, G Bharaj - arXiv preprint arXiv:2407.18517, 2024 - arxiv.org
Audio deepfake detection (ADD) is crucial to combat the misuse of speech synthesized from
generative AI models. Existing ADD models suffer from generalization issues, with a large …

One-class knowledge distillation for spoofing speech detection

J Lu, Y Zhang, W Wang, Z Shang… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
The detection of spoofing speech generated by unseen algorithms remains an unresolved
challenge. One reason for the lack of generalization ability is that traditional detecting …

One-class learning with adaptive centroid shift for audio deepfake detection

HM Kim, K Jang, H Kim - arXiv preprint arXiv:2406.16716, 2024 - arxiv.org
As speech synthesis systems continue to make remarkable advances in recent years, the
importance of robust deepfake detection systems that perform well in unseen systems has …

Beyond silence: Bias analysis through loss and asymmetric approach in audio anti-spoofing

H Shim, M Sahidullah, J Jung, S Watanabe… - arXiv preprint arXiv …, 2024 - arxiv.org
Current trends in audio anti-spoofing detection research strive to improve models' ability to
generalize across unseen attacks by learning to identify a variety of spoofing artifacts. This …

Temporal Variability and Multi-Viewed Self-Supervised Representations to Tackle the ASVspoof5 Deepfake Challenge

Y Xie, X Wang, Z Wang, R Fu, Z Wen, H Cheng… - arXiv preprint arXiv …, 2024 - arxiv.org
ASVspoof5, the fifth edition of the ASVspoof series, is one of the largest global audio security
challenges. It aims to advance the development of countermeasure (CM) to discriminate …

Generalizable Speech Spoofing Detection Against Silence Trimming with Data Augmentation and Multi-task Meta-Learning

L Wang, L Yu, Y Zhang, H Xie - IEEE/ACM Transactions on …, 2024 - ieeexplore.ieee.org
A major difficulty in speech spoofing detection lies in improving the generalization ability to
detect unknown forgery methods. However, most previous methods do not consider the …

PULMO: Precise utterance-level modeling for speech anti-spoofing

S Yoon - Applied Acoustics, 2025 - Elsevier
In recent years, most state-of-the-art approaches for spoofed speech detection have been
based on convolutional neural networks (CNNs). Most neural networks, including CNNs, are …

What's the Real: A Novel Design Philosophy for Robust AI-Synthesized Voice Detection

X Hai, X Liu, Y Tan, G Liu, S Li, W Niu, R Zhou… - Proceedings of the …, 2024 - dl.acm.org
Voice is one of the most widely used media for information transmission in human society.
While high-quality synthetic voices are extensively utilized in various applications, they pose …

How Do Neural Spoofing Countermeasures Detect Partially Spoofed Audio?

T Liu, L Zhang, RK Das, Y Ma, R Tao, H Li - arXiv preprint arXiv …, 2024 - arxiv.org
Partially manipulating a sentence can greatly change its meaning. Recent work shows that
countermeasures (CMs) trained on partially spoofed audio can effectively detect such …