Speaker recognition based on deep learning: An overview
Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …
learning has dramatically revolutionized speaker recognition. However, there is lack of …
A review of speaker diarization: Recent advances with deep learning
Speaker diarization is a task to label audio or video recordings with classes that correspond
to speaker identity, or in short, a task to identify “who spoke when”. In the early years …
to speaker identity, or in short, a task to identify “who spoke when”. In the early years …
Speaker diarization: A review of recent research
Speaker diarization is the task of determining “who spoke when?” in an audio or video
recording that contains an unknown amount of speech and also an unknown number of …
recording that contains an unknown amount of speech and also an unknown number of …
[图书][B] Speaker recognition
H Beigi, H Beigi - 2011 - Springer
The objective of the enrollment process is to modify (adapt) a speaker-independent model
into one that best characterizes the target speaker's vocal tract characteristics. Depending …
into one that best characterizes the target speaker's vocal tract characteristics. Depending …
[PDF][PDF] Speaker, environment and channel change detection and clustering via the bayesian information criterion
S Chen, P Gopalakrishnan - Proc. DARPA broadcast news transcription …, 1998 - Citeseer
In this paper, we are interested in detecting changes in speaker identity, environmental
condition and channel condition; we call this the problem of acoustic change detection. The …
condition and channel condition; we call this the problem of acoustic change detection. The …
An overview of automatic speaker diarization systems
SE Tranter, DA Reynolds - IEEE Transactions on audio, speech …, 2006 - ieeexplore.ieee.org
Audio diarization is the process of annotating an input audio channel with information that
attributes (possibly overlapping) temporal regions of signal energy to their specific sources …
attributes (possibly overlapping) temporal regions of signal energy to their specific sources …
A sticky HDP-HMM with application to speaker diarization
We consider the problem of speaker diarization, the problem of segmenting an audio
recording of a meeting into temporal segments corresponding to individual speakers. The …
recording of a meeting into temporal segments corresponding to individual speakers. The …
[图书][B] MPEG-7 audio and beyond: Audio content indexing and retrieval
Advances in technology, such as MP3 players, the Internet and DVDs, have led to the
production, storage and distribution of a wealth of audio signals, including speech, music …
production, storage and distribution of a wealth of audio signals, including speech, music …
The LIMSI broadcast news transcription system
This paper reports on activites at LIMSI over the last few years directed at the transcription of
broadcast news data. We describe our development work in moving from laboratory read …
broadcast news data. We describe our development work in moving from laboratory read …
An open-source state-of-the-art toolbox for broadcast news diarization
This paper presents the LIUM open-source speaker diarization toolbox, mostly dedicated to
broadcast news. This tool includes both Hierarchical Agglomerative Clustering using well …
broadcast news. This tool includes both Hierarchical Agglomerative Clustering using well …