[HTML][HTML] An experimental review of speaker diarization methods with application to two-speaker conversational telephone speech recordings

L Serafini, S Cornell, G Morrone, E Zovato… - Computer Speech & …, 2023 - Elsevier
We performed an experimental review of current diarization systems for the conversational
telephone speech (CTS) domain. In detail, we considered a total of eight different algorithms …

[PDF][PDF] Robust entropy-based endpoint detection for speech recognition in noisy environments.

J Shen, J Hung, L Lee - ICSLP, 1998 - labrosa.ee.columbia.edu
This paper presents an entropy-based algorithm for accurate and robust endpoint detection
for speech recognition under noisy environments. Instead of using the conventional energy …

NetSTAT: A network-based intrusion detection approach

G Vigna, RA Kemmerer - Proceedings 14th Annual Computer …, 1998 - ieeexplore.ieee.org
Network-based attacks have become common and sophisticated. For this reason, intrusion
detection systems are now shifting their focus from the hosts and their operating systems to …

Spot me if you can: Uncovering spoken phrases in encrypted voip conversations

CV Wright, L Ballard, SE Coull… - … IEEE Symposium on …, 2008 - ieeexplore.ieee.org
Despite the rapid adoption of Voice over IP (VoIP), its security implications are not yet fully
understood. Since VoIP calls may traverse untrusted networks, packets should be encrypted …

Towards a practical lipreading system

Z Zhou, G Zhao, M Pietikäinen - CVPR 2011, 2011 - ieeexplore.ieee.org
A practical lipreading system can be considered either as subject dependent (SD) or subject-
independent (SI). An SD system is user-specific, ie, customized for some particular user …

Robust endpoint detection algorithm based on the adaptive band-partitioning spectral entropy in adverse environments

BF Wu, KC Wang - IEEE Transactions on speech and audio …, 2005 - ieeexplore.ieee.org
In speech processing, endpoint detection in noisy environments is difficult, especially in the
presence of nonstationary noise. Robust endpoint detection is one of the most important …

[PDF][PDF] Entropy based voice activity detection in very noisy conditions.

P Renevey, A Drygajlo - INTERSPEECH, 2001 - researchgate.net
This paper addresses the problem of robust voice activity detection (VAD) capable for
working at very low signal-to-noise ratios (SNR< 10dB). A new algorithm that we propose is …

An end-to-end multimodal voice activity detection using wavenet encoder and residual networks

I Ariav, I Cohen - IEEE Journal of Selected Topics in Signal …, 2019 - ieeexplore.ieee.org
Recently, there has been growing use of deep neural networks in many modern speech-
based systems such as speaker recognition, speech enhancement, and emotion …

A novel approach to robust speech endpoint detection in car environments

L Huang, C Yang - 2000 IEEE International Conference on …, 2000 - ieeexplore.ieee.org
In the process of speech recognition, it is especially crucial to precisely locate endpoints of
the input utterance to be free of non-speech regions. This paper proposes a novel approach …

System and method for multimodal utterance detection

AP Rao - US Patent 9,922,640, 2018 - Google Patents
The disclosure describe a system and method for detecting one or more segments of
desired speech utterances from an audio stream using timings of events from other modes …