Speaker recognition based on deep learning: An overview

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier
Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

Online end-to-end neural diarization with speaker-tracing buffer

Y Xue, S Horiguchi, Y Fujita… - 2021 IEEE Spoken …, 2021 - ieeexplore.ieee.org
This paper proposes a novel online speaker diarization algorithm based on a fully
supervised self-attention mechanism (SA-EEND). Online diarization inherently presents a …

Online neural diarization of unlimited numbers of speakers using global and local attractors

S Horiguchi, S Watanabe, P García… - … on Audio, Speech …, 2022 - ieeexplore.ieee.org
A method to perform offline and online speaker diarization for an unlimited number of
speakers is described in this paper. End-to-end neural diarization (EEND) has achieved …

Development of supervised speaker diarization system based on the pyannote audio processing library

V Khoma, Y Khoma, V Brydinskyi, A Konovalov - Sensors, 2023 - mdpi.com
Diarization is an important task when work with audiodata is executed, as it provides a
solution to the problem related to the need of dividing one analyzed call recording into …

Online streaming end-to-end neural diarization handling overlapping speech and flexible numbers of speakers

Y Xue, S Horiguchi, Y Fujita, Y Takashima… - arXiv preprint arXiv …, 2021 - arxiv.org
We propose a streaming diarization method based on an end-to-end neural diarization
(EEND) model, which handles flexible numbers of speakers and overlapping speech. In our …

Meta-learning with latent space clustering in generative adversarial network for speaker diarization

M Pal, M Kumar, R Peri, TJ Park, SH Kim… - … ACM transactions on …, 2021 - ieeexplore.ieee.org
The performance of most speaker diarization systems with x-vector embeddings is both
vulnerable to noisy environments and lacks domain robustness. Earlier work on speaker …

[PDF][PDF] Online Speaker Diarization with Core Samples Selection.

Y Yue, J Du, MK He, YT Yeung, R Wang - INTERSPEECH, 2022 - isca-archive.org
We propose a novel online speaker diarization approach based on the VBx algorithm which
works well on the offline speaker diarization tasks. To efficiently process long-time …

End-to-end Online Speaker Diarization with Target Speaker Tracking

W Wang, M Li - arXiv preprint arXiv:2310.08696, 2023 - arxiv.org
This paper proposes an online target speaker voice activity detection system for speaker
diarization tasks, which does not require a priori knowledge from the clustering-based …

Online target speaker voice activity detection for speaker diarization

W Wang, Q Lin, M Li - arXiv preprint arXiv:2207.05920, 2022 - arxiv.org
This paper proposes an online target speaker voice activity detection system for speaker
diarization tasks, which does not require a priori knowledge from the clustering-based …

Hybrid speech and text analysis methods for speaker change detection

OH Anidjar, I Lapidot, C Hajaj, A Dvir… - IEEE/ACM Transactions …, 2021 - ieeexplore.ieee.org
Speaker Change Detection (SCD) is the task of segmenting an input audio-recording
according to speaker interchanges. Nowadays, many applications, such as Speaker …