Audio deepfake detection: A survey

J Yi, C Wang, J Tao, X Zhang, CY Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
Audio deepfake detection is an emerging active topic. A growing number of literatures have
aimed to study deepfake detection algorithms and achieved effective performance, the …

Deepfakes as a threat to a speaker and facial recognition: An overview of tools and attack vectors

A Firc, K Malinka, P Hanáček - Heliyon, 2023 - cell.com
Deepfakes present an emerging threat in cyberspace. Recent developments in machine
learning make deepfakes highly believable, and very difficult to differentiate between what is …

Learning from yourself: A self-distillation method for fake speech detection

J Xue, C Fan, J Yi, C Wang, Z Wen… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
In this paper, we propose a novel self-distillation method for fake speech detection (FSD),
which can significantly improve the performance of FSD without increasing the model …

From Audio Deepfake Detection to AI-Generated Music Detection--A Pathway and Overview

Y Li, M Milling, L Specia, BW Schuller - arXiv preprint arXiv:2412.00571, 2024 - arxiv.org
As Artificial Intelligence (AI) technologies continue to evolve, their use in generating realistic,
contextually appropriate content has expanded into various domains. Music, an art form and …

Detection of cross-dataset fake audio based on prosodic and pronunciation features

C Wang, J Yi, J Tao, C Zhang, S Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
Existing fake audio detection systems perform well in in-domain testing, but still face many
challenges in out-of-domain testing. This is due to the mismatch between the training and …

A lightweight feature extraction technique for deepfake audio detection

N Chakravarty, M Dua - Multimedia Tools and Applications, 2024 - Springer
The emergence of audio deepfakes has prompted concerns over reputational integrity and
dependability. Deepfakes with audio can now be produced more easily, which makes it …

Dual-Branch Knowledge Distillation for Noise-Robust Synthetic Speech Detection

C Fan, M Ding, J Tao, R Fu, J Yi… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org
Most research in synthetic speech detection (SSD) focuses on improving performance on
standard noise-free datasets. However, in actual situations, noise interference is usually …

Detecting Audio Deepfakes: Integrating CNN and BiLSTM with Multi-Feature Concatenation

TM Wani, SAA Qadri, D Comminiello… - Proceedings of the 2024 …, 2024 - dl.acm.org
Audio deepfake detection is emerging as a crucial field in digital media, as distinguishing
real audio from deepfakes becomes increasingly challenging due to the advancement of …

Multi-perspective Information Fusion Res2Net with RandomSpecmix for Fake Speech Detection

S Dong, J Xue, C Fan, K Zhu, Y Chen, Z Lv - arXiv preprint arXiv …, 2023 - arxiv.org
In this paper, we propose the multi-perspective information fusion (MPIF) Res2Net with
random Specmix for fake speech detection (FSD). The main purpose of this system is to …

ABC-CapsNet: Attention based Cascaded Capsule Network for Audio Deepfake Detection

TM Wani, R Gulzar, I Amerini - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
In response to the escalating challenge of audio deepfake detection this study introduces
ABC-CapsNet (Attention-Based Cascaded Capsule Network) a novel architecture that …