Speaker recognition from raw waveform with sincnet

A Mehrish, N Majumder, R Bharadwaj, R Mihalcea… - Information …, 2023 - Elsevier

The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …

被引用次数：99 相关文章所有 6 个版本

[HTML] peerj.com

[HTML][HTML] Computational bioacoustics with deep learning: a review and roadmap

D Stowell - PeerJ, 2022 - peerj.com

Animal vocalisations and natural soundscapes are fascinating objects of study, and contain
valuable evidence about animal behaviours, populations and ecosystems. They are studied …

被引用次数：194 相关文章所有 10 个版本

[PDF] arxiv.org

Curriculum learning: A survey

P Soviany, RT Ionescu, P Rota, N Sebe - International Journal of …, 2022 - Springer

Training machine learning models in a meaningful order, from the easy samples to the hard
ones, using curriculum learning can provide performance improvements over the standard …

被引用次数：308 相关文章所有 10 个版本

[PDF] arxiv.org

Speaker recognition based on deep learning: An overview

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier

Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

被引用次数：343 相关文章所有 9 个版本

[PDF] arxiv.org

Aasist: Audio anti-spoofing using integrated spectro-temporal graph attention networks

J Jung, HS Heo, H Tak, H Shim… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

Artefacts that differentiate spoofed from bona-fide utterances can reside in specific temporal
or spectral intervals. Their reliable detection usually depends upon computationally …

被引用次数：209 相关文章所有 12 个版本

[PDF] arxiv.org

End-to-end anti-spoofing with rawnet2

H Tak, J Patino, M Todisco, A Nautsch… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org

Spoofing countermeasures aim to protect automatic speaker verification systems from being
manipulated by spoofed speech signals. While results from the most recent ASVspoof 2019 …

被引用次数：285 相关文章所有 12 个版本

[PDF] arxiv.org

In defence of metric learning for speaker recognition

JS Chung, J Huh, S Mun, M Lee, HS Heo… - arXiv preprint arXiv …, 2020 - arxiv.org

The objective of this paper is' open-set'speaker recognition of unseen speakers, where ideal
embeddings should be able to condense information into a compact utterance-level …

被引用次数：470 相关文章所有 11 个版本

[PDF] arxiv.org

Deep learning for audio signal processing

H Purwins, B Li, T Virtanen, J Schlüter… - IEEE Journal of …, 2019 - ieeexplore.ieee.org

Given the recent surge in developments of deep learning, this paper provides a review of the
state-of-the-art deep learning techniques for audio signal processing. Speech, music, and …

被引用次数：800 相关文章所有 7 个版本

[HTML] springer.com

[HTML][HTML] Recent advances in the application of deep learning for fault diagnosis of rotating machinery using vibration signals

BA Tama, M Vania, S Lee, S Lim - Artificial Intelligence Review, 2023 - Springer

Vibration measurement and monitoring are essential in a wide variety of applications.
Vibration measurements are critical for diagnosing industrial machinery malfunctions …

被引用次数：76 相关文章所有 8 个版本

[PDF] arxiv.org

Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation

H Tak, M Todisco, X Wang, J Jung, J Yamagishi… - arXiv preprint arXiv …, 2022 - arxiv.org

The performance of spoofing countermeasure systems depends fundamentally upon the use
of sufficiently representative training data. With this usually being limited, current solutions …

被引用次数：106 相关文章所有 9 个版本