Speaker diarization: A review of recent research

X Anguera, S Bozonnet, N Evans… - … on audio, speech …, 2012 - ieeexplore.ieee.org
Speaker diarization is the task of determining “who spoke when?” in an audio or video
recording that contains an unknown amount of speech and also an unknown number of …

An information theoretic approach to speaker diarization of meeting data

D Vijayasenan, F Valente… - IEEE transactions on …, 2009 - ieeexplore.ieee.org
A speaker diarization system based on an information theoretic framework is described. The
problem is formulated according to the information bottleneck (IB) principle. Unlike other …

A comparative study of bottom-up and top-down approaches to speaker diarization

N Evans, S Bozonnet, D Wang… - … on Audio, speech …, 2012 - ieeexplore.ieee.org
This paper presents a theoretical framework to analyze the relative merits of the two most
general, dominant approaches to speaker diarization involving bottom-up and top-down …

Speaker diarization and linking of meeting data

M Ferras, S Madikeri, H Bourlard - IEEE/ACM Transactions on …, 2016 - ieeexplore.ieee.org
Finding who spoke when in a collection of recordings, with speakers being uniquely
identified across the database, is a challenging task. In this scenario, reasonable computing …

System output combination for improved speaker diarization

S Bozonnet, N Evans, X Anguera, O Vinyals… - … 2010, September 26 …, 2010 - hal.science
Abstract System combination or fusion is a popular, successful and sometimes
straightforward means of improving performance in many fields of statistical pattern …

System fusion and speaker linking for longitudinal diarization of TV shows

M Ferras, S Madikeri, P Motlicek… - 2016 IEEE International …, 2016 - ieeexplore.ieee.org
Performing speaker diarization while uniquely identifying the speakers in a collection of
audio recordings is a challenging task. Based on our previous work on speaker diarization …

An information theoretic approach to speaker diarization of meeting recordings

D Vijayasenan - 2010 - infoscience.epfl.ch
In this thesis we investigate a non parametric approach to speaker diarization for meeting
recordings based on an information theoretic framework. The problem is formulated using …

Analysis of phonetic dependence of segmentation errors in speaker diarization

SW McKnight, AOT Hogg… - 2020 28th European …, 2021 - ieeexplore.ieee.org
Evaluation of speaker segmentation and diarization normally makes use of forgiveness
collars around ground truth speaker segment boundaries such that estimated speaker …

[PDF][PDF] Speaker Clustering of Stereo Audio Documents Based on Sequential Gathering Process.

H Sayoud, S Ouamour - J. Inf. Hiding Multim. Signal Process., 2010 - bit.kuas.edu.tw
This paper focuses on the use of sequential speaker clustering of stereo audio documents to
obtain a classification of the different speech segments contained in those documents …

An integrated top-down/bottom-up approach to speaker diarization

S Bozonnet, N Evans, C Fredouille, D Wang… - … 2010, September 26 …, 2010 - hal.science
Most speaker diarization systems fit into one of two categories: bottom-up or top-down.
Bottom-up systems are the most popular but can sometimes suffer from instability from …