- 学术资源搜索

Speaker recognition based on deep learning: An overview

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier

Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

被引用次数：390 相关文章所有 9 个版本

[PDF] arxiv.org

A review of speaker diarization: Recent advances with deep learning

TJ Park, N Kanda, D Dimitriadis, KJ Han… - Computer Speech & …, 2022 - Elsevier

Speaker diarization is a task to label audio or video recordings with classes that correspond
to speaker identity, or in short, a task to identify “who spoke when”. In the early years …

被引用次数：365 相关文章所有 7 个版本

[PDF] hal.science

Speaker diarization: A review of recent research

X Anguera, S Bozonnet, N Evans… - … on audio, speech …, 2012 - ieeexplore.ieee.org

Speaker diarization is the task of determining “who spoke when?” in an audio or video
recording that contains an unknown amount of speech and also an unknown number of …

被引用次数：897 相关文章所有 20 个版本

[PDF] researchgate.net

[图书][B] Speaker recognition

H Beigi, H Beigi - 2011 - Springer

The objective of the enrollment process is to modify (adapt) a speaker-independent model
into one that best characterizes the target speaker's vocal tract characteristics. Depending …

被引用次数：592 相关文章所有 21 个版本

[PDF] psu.edu

[PDF][PDF] Speaker, environment and channel change detection and clustering via the bayesian information criterion

S Chen, P Gopalakrishnan - Proc. DARPA broadcast news transcription …, 1998 - Citeseer

In this paper, we are interested in detecting changes in speaker identity, environmental
condition and channel condition; we call this the problem of acoustic change detection. The …

被引用次数：1237 相关文章所有 6 个版本

[PDF] univ-avignon.fr

An overview of automatic speaker diarization systems

SE Tranter, DA Reynolds - IEEE Transactions on audio, speech …, 2006 - ieeexplore.ieee.org

Audio diarization is the process of annotating an input audio channel with information that
attributes (possibly overlapping) temporal regions of signal energy to their specific sources …

被引用次数：822 相关文章所有 12 个版本

[PDF] projecteuclid.org

A sticky HDP-HMM with application to speaker diarization

EB Fox, EB Sudderth, MI Jordan, AS Willsky - The Annals of Applied …, 2011 - JSTOR

We consider the problem of speaker diarization, the problem of segmenting an audio
recording of a meeting into temporal segments corresponding to individual speakers. The …

被引用次数：490 相关文章所有 25 个版本

[图书][B] MPEG-7 audio and beyond: Audio content indexing and retrieval

HG Kim, N Moreau, T Sikora - 2006 - books.google.com

Advances in technology, such as MP3 players, the Internet and DVDs, have led to the
production, storage and distribution of a wealth of audio signals, including speech, music …

被引用次数：454 相关文章所有 5 个版本

[PDF] psu.edu

The LIMSI broadcast news transcription system

JL Gauvain, L Lamel, G Adda - Speech communication, 2002 - Elsevier

This paper reports on activites at LIMSI over the last few years directed at the transcription of
broadcast news data. We describe our development work in moving from laboratory read …

被引用次数：550 相关文章所有 7 个版本

[PDF] hal.science

An open-source state-of-the-art toolbox for broadcast news diarization

M Rouvier, G Dupuy, P Gay, E Khoury, T Merlin… - Interspeech, 2013 - hal.science

This paper presents the LIUM open-source speaker diarization toolbox, mostly dedicated to
broadcast news. This tool includes both Hierarchical Agglomerative Clustering using well …

被引用次数：207 相关文章所有 18 个版本