Speaker diarization: A review of recent research

X Anguera, S Bozonnet, N Evans… - … on audio, speech …, 2012 - ieeexplore.ieee.org
Speaker diarization is the task of determining “who spoke when?” in an audio or video
recording that contains an unknown amount of speech and also an unknown number of …

A review on speaker diarization systems and approaches

MH Moattar, MM Homayounpour - Speech Communication, 2012 - Elsevier
Speaker indexing or diarization is an important task in audio processing and retrieval.
Speaker diarization is the process of labeling a speech signal with labels corresponding to …

[PDF][PDF] The EPAC corpus: Manual and automatic annotations of conversational speech in french broadcast news.

Y Esteve, T Bazillon, JY Antoine, F Béchet, J Farinas - LREC, 2010 - researchgate.net
This paper presents the EPAC corpus which is composed by a set of 100 hours of
conversational speech manually transcribed and by the outputs of automatic tools …

Hybrid speech and text analysis methods for speaker change detection

OH Anidjar, I Lapidot, C Hajaj, A Dvir… - IEEE/ACM Transactions …, 2021 - ieeexplore.ieee.org
Speaker Change Detection (SCD) is the task of segmenting an input audio-recording
according to speaker interchanges. Nowadays, many applications, such as Speaker …

System and method for cluster-based audio event detection

E Khoury, M Garland - US Patent 10,141,009, 2018 - Google Patents
Methods, systems, and apparatuses for audio event detection, where the determination of a
type of sound data is made at the cluster level rather than at the frame level. The techniques …

Audiovisual diarization of people in video content

E El Khoury, C Sénac, P Joly - Multimedia tools and applications, 2014 - Springer
Abstract Audio-Visual People Diarization (AVPD) is an original framework that
simultaneously improves audio, video, and audiovisual diarization results. Following a …

The IMMED project: wearable video monitoring of people with age dementia

R Mégret, V Dovgalecs, H Wannous… - Proceedings of the 18th …, 2010 - dl.acm.org
In this paper, we describe a new application for multimedia indexing, using a system that
monitors the instrumental activities of daily living to assess the cognitive decline caused by …

Face-and-clothing based people clustering in video content

E El Khoury, C Sénac, P Joly - Proceedings of the international …, 2010 - dl.acm.org
Content-based people clustering is a crucial step for people indexing within video
documents. In this paper, we investigate the use of both face and clothing features. A …

Spatial features selection for unsupervised speaker segmentation and clustering

B Martínez-González, JM Pardo… - Expert Systems with …, 2017 - Elsevier
The selection of the best features to be used in expert systems is a key issue in obtaining a
satisfactory performance. Unsupervised speaker segmentation and clustering is the task of …

Methods for creating and searching a database of speakers

W Jeon, YM Cheng, C Ma, D Macho - US Patent 8,442,823, 2013 - Google Patents
(57) ABSTRACT A method of performing a search of a database of speakers, includes:
receiving a query speech sample spoken by a query speaker, deriving a query utterance …