A review of deep learning techniques for speech processing

A Mehrish, N Majumder, R Bharadwaj, R Mihalcea… - Information …, 2023 - Elsevier
The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …

[HTML][HTML] Computational bioacoustics with deep learning: a review and roadmap

D Stowell - PeerJ, 2022 - peerj.com
Animal vocalisations and natural soundscapes are fascinating objects of study, and contain
valuable evidence about animal behaviours, populations and ecosystems. They are studied …

Curriculum learning: A survey

P Soviany, RT Ionescu, P Rota, N Sebe - International Journal of …, 2022 - Springer
Training machine learning models in a meaningful order, from the easy samples to the hard
ones, using curriculum learning can provide performance improvements over the standard …

Speaker recognition based on deep learning: An overview

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier
Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

Aasist: Audio anti-spoofing using integrated spectro-temporal graph attention networks

J Jung, HS Heo, H Tak, H Shim… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Artefacts that differentiate spoofed from bona-fide utterances can reside in specific temporal
or spectral intervals. Their reliable detection usually depends upon computationally …

End-to-end anti-spoofing with rawnet2

H Tak, J Patino, M Todisco, A Nautsch… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Spoofing countermeasures aim to protect automatic speaker verification systems from being
manipulated by spoofed speech signals. While results from the most recent ASVspoof 2019 …

In defence of metric learning for speaker recognition

JS Chung, J Huh, S Mun, M Lee, HS Heo… - arXiv preprint arXiv …, 2020 - arxiv.org
The objective of this paper is' open-set'speaker recognition of unseen speakers, where ideal
embeddings should be able to condense information into a compact utterance-level …

Deep learning for audio signal processing

H Purwins, B Li, T Virtanen, J Schlüter… - IEEE Journal of …, 2019 - ieeexplore.ieee.org
Given the recent surge in developments of deep learning, this paper provides a review of the
state-of-the-art deep learning techniques for audio signal processing. Speech, music, and …

[HTML][HTML] Recent advances in the application of deep learning for fault diagnosis of rotating machinery using vibration signals

BA Tama, M Vania, S Lee, S Lim - Artificial Intelligence Review, 2023 - Springer
Vibration measurement and monitoring are essential in a wide variety of applications.
Vibration measurements are critical for diagnosing industrial machinery malfunctions …

Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation

H Tak, M Todisco, X Wang, J Jung, J Yamagishi… - arXiv preprint arXiv …, 2022 - arxiv.org
The performance of spoofing countermeasure systems depends fundamentally upon the use
of sufficiently representative training data. With this usually being limited, current solutions …